Contraceptive vaccine

ABSTRACT

PCT No. PCT/US95/07295 Sec. 371 Date Dec. 19, 1996 Sec. 102(e) Date Dec. 19, 1996 PCT Filed Jun. 6, 1995 PCT Pub. No. WO95/35118 PCT Pub. Date Dec. 28, 1995The instant invention is drawn to a sperm surface protein in substantially pure form selected from a human PH30 beta chain protein and a mouse PH30 beta chain proteins. Such proteins are useful as contraceptive vaccines in humans and mice respectively, and for identifying small molecules that will disrupt sperm-egg interaction and fertilization.

This application is a national stage entry of PCT/US95/07295 filed Jun. 6, 1995, which is a continuation-in-part of U.S. application Ser. No. 08/264,101 filed Jun. 20, 1994, now U.S. Pat. No. 5,693,496.

FIELD OF THE INVENTION

The present invention provides sperm surface proteins and DNA sequences encoding the proteins which are useful in the prevention of fertilization. More particularly, the cloning and characterization of the mouse and human PH30 beta chain genes, as well as their use as contraceptive vaccines, are described.

BACKGROUND OF THE INVENTION

Four methods of family planning are currently available in the U.S., sterilization, abstinence, abortion and contraception. Of these four birth control methods, contraception is the most widely utilized. Despite the substantial U.S. and global demand for contraception, the presently available methodologies fall short of market needs. Oral contraceptives and barrier methods dominate today's contraceptive market but have significant shortcomings. Oral contraceptives, though efficacious, are documented to be associated with significant side effects including increased risks of cardiovascular disease and breast cancer and are not recommended for women over the age of 35. Barrier methods, while safe, have failure rates approaching 20%. There is a clear need for increased availability of and improvements in contraceptives that offer superior safety, efficacy, convenience, acceptability and are affordable to women and men worldwide. Identification of novel approaches for controlling fertility is therefore necessary.

Immunization of male and female animals with extracts of whole sperm is known to cause infertility. Tung, K., et al., J. Reproductive Immunol., 1; 145-158 (1979); Menge, A., et al., Biol. of Reproduction, 20, 931-937 (1979)!. Moreover, men and women who spontaneously produce antisperm antibodies are infertile, but otherwise healthy. Bronson, R., et al., Fert. and Sterile, 42, 171-183 (1984)!. Although the critical sperm antigens are unknown, these observations have led to the proposal that sperm proteins might be useful in the development of a contraceptives vaccine.

In mammalian species, sperm proteins are believed to have a role in sperm adhesion to the zona pellucida of the egg. The PH30 protein is known to be involved in sperm egg binding and antibodies that bind to PH30 inhibit this interaction. PH30 is an integral membrane protein present on posterior head of sperm which mediates sperm-oocyte fusion. The PH30 protein consists of two immunologically distinct alpha and beta subunits. Both subunits are made as larger precursors and then finally processed in epididymis where sperm become fertilization competent. Primakoff, P., et al., J. Cell Biology, 104, 141-149 (1987); Blobel, C. P., et al., J. Cell Biology, 111, 69-78 (1990)!. Monoclonal antibodies that recognize PH30 inhibit sperm-oocyte fusion in vitro, indicating its importance in fertilization Primakoff, P., et al., J. Cell Biology, 104, 141-149 (1987)!.

Guinea pig PH30 alpha and beta chains have been cloned by Blobel et al. Mature PH30 alpha chain consists of 289 amino acids and encodes a transmembrane domain as well as an integral fusion peptide (82-102) that is similar to a potential fusion peptide of E2 glycoprotein of rubella virus. Guinea Pig PH30 beta chain has an open reading frame of 353 amino acids and also encodes a transmembrane domain. Blobel C. P., et al., Nature, 356, 248-251 (1992)!. The predicted amino acid sequence of the PH30 beta chain protein contains significant homology to a class of proteins called disintigrins found in snake venom. These proteins are known to bind to a family of proteins called integrins and prevent their normal functioning in cell adhesion (a well studied example is platelet aggregation). The N-terminal ninety amino acids integrin binding disintigrin domain of PH30 beta has been postulated to mediate the binding of PH30 to its putative integrin receptor on oocytes. The cloning and sequence determination of the mouse and human PH30 beta chain genes would permit novel approaches to the control of sperm egg binding and fusions. These approaches include, but are not limited to, eliciting an immune response directed at all or part of the PH30 beta chain protein and using the PH30 beta chain protein as part of a screen to identify small molecules that alter sperm egg interactions.

Mammalian fertilization is, in most cases, species specific. Thus, the identification and isolation of sperm surface proteins essential for fertilization in species other than guinea pig would be useful for providing effective long lasting contraception in those species. Thus far, the lack of biochemical identification, isolation and cloning of candidate adhesion proteins of sperm has hindered scientists in developing effective contraceptives for humans as well as other mammalian species.

SUMMARY OF THE INVENTION

The instant invention relates to a sperm protein in substantially pure form selected from a human PH30 beta chain protein, a mouse PH30 beta chain protein or an amino acid sequence substantially homologous to either the human or mouse PH30 beta chain protein.

In one embodiment of the invention is the sperm protein having an integrin binding sequence which is not TDE.

In one class is the sperm protein wherein the integrin binding sequence is selected from FEE or QDE.

In a subclass is the sperm protein which is the human PH30 beta chain protein.

Illustrative of this subclass is the sperm protein having an integrin binding sequence that is FEE.

Further illustrating the invention is a DNA sequence which encodes the sperm protein or a portion of the spern protein sufficient to constitute at least one epitope.

An illustration is the DNA sequence wherein the epitope is on the native protein.

Exemplifying the invention is the DNA sequence which encodes all or a portion of human PH30 beta chain protein.

An example of the invention is the DNA sequence, wherein the DNA encoding all or a portion of the human PH30 beta protein is characterized by the ability to hybridize, under standard conditions, to the DNA sequence shown in SEQ ID NO: 1.

More particularly illustrating the invention is a contraceptive composition comprising a therapeutically effective amount of the protein, or a polypeptide having the substantially same amino acid sequence as a segment of the protein provided that the polypeptide is sufficient to constitute at least one epitope, and a pharmaceutically acceptable carrier.

Another illustration is the contraceptive composition wherein the epitope is on the native protein.

Further exemplifying the invention is the contraceptive composition, wherein the protein is the human PH30 beta chain protein.

More specifically illustrating the invention is the contraceptive composition, wherein the protein is produced by expressing the gene encoding an immunogenic epitope of the sperm protein in a recombinant DNA expression vector.

Specifically exemplifying the invention is a vector comprising an inserted DNA sequence encoding for the protein.

A further illustration of the invention is the vector, wherein the inserted DNA sequence is characterized by the ability to hybridize, under standard conditions, to a DNA sequence selected from the DNA sequences of SEQ ID NO: 1 or SEQ ID NO: 3.

Another example of the invention is a host that is compatible with and contains the vector.

More specifically exemplifying the invention is a method of producing a human or mouse PH30 beta chain sperm protein, comprising the steps of culturing cells containing PH30 beta chain DNA and recovering the sperm protein from the cell culture.

A further example is the method wherein the DNA encoding all or a portion of the PH30 beta chain protein is characterized by the ability to hybridize, under standard conditions, to a DNA sequence selected from the DNA sequences of SEQ ID NO: 1 or SEQ ID NO: 3.

A more specific illustration is a method of contraception in a human or mouse subject in need thereof, comprising administering to the subject an amount of the sperm protein which is effective for the stimulation of antibodies which bind to the sperm protein in vivo, thereby preventing or substantially reducing the rate of sperm-egg fusion.

Further illustrating the invention is the method wherein the sperm protein has an integrin binding sequence which is not TDE.

Another illustration is the PH30 beta chain protein made by the process described.

Another example is a DNA sequence as shown in Seq. ID No. 1 encoding human PH30 beta chain protein.

Still further illustrating the invention is a purified and isolated DNA sequence consisting essentially of a DNA sequence encoding a polypeptide having an amino acid sequence sufficiently duplicative of that of human or mouse PH30 beta to allow the possession of the biological property of initiating sperm-egg binding or promoting sperm-egg fusion. This biological activity can be determined using the in vitro sperm-oocyte binding/fusion assays Primakoff, P., et al., J. Cell. Biol., 104: 141-149 (1987)!.

More particularly exemplifying the invention is the DNA sequence wherein the amino acid sequence contains an integrin binding sequence which is not TDE.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A, 1B, 1C, 1D, 1E, and 1G comprise a diagram representing the human PH30 beta cDNA gene sequence encoding the human PH-30 beta protein, and the deduced amino acid sequence of the human PH-30 beta protein present in three letter code. The sequence disclosure of FIGS. 1A through 1G is represented as SEQ ID NO: 1 and 2.

FIGS. 2A, 2B, 2C, 2D and 2E comprise a diagram representing the mouse PH30 beta cDNA gene sequence, and the deduced amino acid sequence of the mouse PH-30 beta protein present in three letter code. The sequence disclosure of FIGS. 2A through 2E is represented as SEQ ID NO: 3 and 4.

FIGS. 3A, 3B, 3C, 3D, 3E, 3G, 3H, 3I, 3J and 3K represent a restriction MAP of the human PH30 beta cDNA sequence.

FIGS. 4A, 4B, 4C, 4D, 4E, 4G and 4H represent a restriction MAP of the mouse PH30 beta cDNA sequence.

DETAILED DESCRIPTION OF THE INVENTION

The subject invention relates to sperm surface proteins which are essential for fertilization, or portions thereof, and their use in contraceptive methods. A sperm surface protein is essential for fertilization if, for example, a monoclonal antibody to the protein or a polyclonal antibody raised against the purified protein, when bound to sperm, inhibits in vitro or in vivo fertilization or any step of in vitro fertilization. The process of fertilization is defined as the binding or fusion of two gametes (sperm and egg) followed by the fusion of their nuclei to form the genome of a new organism. The surface protein can be located in the plasma membrane of sperm and/or the inner acrosomal membrane. It can be a protein or glycoprotein. The isolated surface protein used for immunization can comprise the entire surface protein or some portion of the protein (external to the cell) which is immunogenic. Two such sperm surface proteins are the mouse and human PH30 beta chain sperm surface proteins. The PH30 beta genes encode proteins which are present on the surface of sperm cells and are essential for fertilization.

As used herein, a protein or peptide is "substantially pure" when that protein or peptide has been purified to the extent that it is essentially free of other molecules with which it is associated in nature. The term "substantially pure" is used relative to proteins or peptides with which the peptides of the instant invention are associated in nature, and are not intended to exclude compositions in which the peptide of the invention is admixed with nonproteinous pharmaceutical carriers or vehicles.

As used herein, an amino acid sequence substantially homologous to a referent PH-30 beta protein will have at least 70% sequence homology, preferably 80%, and most preferably 90% sequence homology with the amino acid sequence of a referent PH-30 beta protein or a peptide thereof. For example, an amino acid sequence is substantially homologous to mouse PH-30 beta protein if, when aligned with mouse PH30 beta protein, at least 70% of its amino acid residues are the same. In addition, it is preferable that the substantially homologous amino acid sequence contains the integrin binding sequence.

As used herein, a DNA sequence substantially homologous to a referent PH-30 beta protein will have at least 70%, preferably 80%, and most preferably 90% sequence homology with the DNA sequence of a referent PH-30 beta. Moreover, a DNA sequence substantially homologous to a referent PH-30 beta protein is characterized by the ability to hybridize to the DNA sequence of a referent PH30 beta under standard conditions. Standard hybridization conditions are described in Maniatis, T., et al. (1989) Molecular Cloning, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.

An "expression vector" or "vector," as used herein, refers to a plasmid, bacteriophage, virus, or other molecule into which a gene of interest may be cloned, such that the appropriate signals for expression of that gene are present on that vector.

The term "epitope," as used herein, refers to the minimum amount of PH30 beta sequence capable of producing an efficatious, i.e., contraceptive, immune response.

The term "therapeutically effective amount," as used herein, means that amount of a drug or pharmaceutical agent that will elicit the biological or medical response that is being sought by a researcher or clinician.

Production and Purification of Immunogen

A preferred method for producing sperm surface proteins for use as a contraceptive immunogen is by recombinant DNA technology. To produce the protein using this technology it is necessary to isolate and clone DNA encoding the protein, or an immunogenic portion thereof. Those skilled in the art are familiar with a variety of approaches which can be used in an effort to clone a gene of interest. However, having nothing more than the isolated protein of interest, success in such an effort cannot be predicted with a reasonable degree of certainty.

In the Examples which follow, Applicants describe the cloning and characterization of the mouse and human PH30 beta chain genes. The mouse and human PH30 beta chain genes were isolated using a cDNA encoding the guinea pig PH30 beta chain gene. The instant invention provides specific sequence information to permit targeted intervention in controlling fertility through anti PH30 directed immune responses inhibition of sperm-egg binding and triggering of post binding signaling and effective events. These sequences permit the generation of reagents for the isolation of oocyte proteins involved in sperm-egg interaction.

The information presented in the Examples enable one skilled in the art to isolate and clone the mouse or human PH30 beta chain gene. For example, a cDNA library is prepared from testis or spermatogenic cells isolated from the mammal of interest (e.g., mouse, human). Such a cDNA library is then screened using, for example, labeled guinea pig PH30 DNA probes. DNA encoding all or a portion of human or mouse PH30 is characterized by the ability to hybridize to such a probe sequence under hybridization conditions such as those described in Example 1. Methods of labeling and screening by hybridization are well known in the art. Positive clones are analyzed. and a full length cDNA is constructed by conventional methods.

The cloned gene, or portions thereof which encode an immunogenic region of the PH30 protein, can be expressed by inserting the coding region into an expression vector to produce an expression construct. Many such expression vectors are known to those skilled in the art. These vectors contain a promoter for the gene of interest as well as additional transcriptional and translational signals. Expression vectors for both eukaryotic host cells and prokaryotic host cells are widely available. The DNA expression construct is used to transform an appropriate host cell.

Eukaryotic, in particular mammalian, host cells are often utilized for the expression of eukaryotic proteins. It has been found, for example, that eukaryotic proteins may exhibit folding problems when expressed in prokaryotic cells. In addition, production of authentic, biologically active eukaryotic proteins from cloned DNA sometimes requires post-translational modification such as disulfide bond formation, glycosylation, phosphorylation or specific proteolytic cleavage processes that are not performed in bacterial cells. This is especially true with membrane proteins. The sperm surface protein is produced using the transcriptional and translational components of the host cell. After an appropriate growth and expression period, the host cell culture is lysed and the sperm surface protein is purified from the lysate. Lysis buffers typically include non-ionic detergent, protease inhibitors, etc.

From the solubilized cell extract, the sperm surface protein can be purified and isolated by physical and biochemical methods such as ultracentrifugation, column chromatography, high performance liquid chromatography, electrophoresis, etc. Alternatively, the sperm surface protein can be isolated by affinity chromatography using monoclonal or polyclonal antibodies see Primakoff et al., Biol. of Reprod. 38, 921-934 (1988)!. Such methods for purifying proteins are well known to those skilled in the art.

As mentioned above, antigenic portions or epitopes of the sperm surface protein are useful as immunogen, in addition to the full length protein. Antigenic fragments can be produced, for example, by proteolytic digestion of the full length protein, followed by isolation of the desired fragment. Alternatively, chemical synthesis can be used to generate the desired fragment starting with monomer amino acid residues.

With respect to the PH30 protein, certain antigenic domains are preferred candidates for use in a contraceptive vaccine. As is discussed in greater detail in the Exemplification section which follows, the PH30 β subunit contains a domain which is highly conserved when compared to a class of proteins known as disintegrins. A peptide (or portion thereof) which is identical or substantially identical to this domain is preferred for use in the contraceptive methods of this invention. Substantially identical, as used in the preceding sentence, means that at least 70% of the amino acid sequence of the peptide is identical to the corresponding portion of the PH30 β disintegrin domain.

Disintegrins are found in snake venom, for example, and are known to bind to a class of platelet surface proteins known as integrins. The binding of disintegrins to integrins has been shown to inhibit blood clotting. By analogy, peptides corresponding to the PH30 β disintegrin domain are predicted to be active in sperm-egg binding and fusion.

Contraceptive Vaccine

Once the sperm surface protein has been produced and purified, a vaccine can be produced by combining the sperm surface protein or portion thereof with a suitable carrier for administration to a subject for immunization. For successful vaccine development it is necessary that the immunogen exhibit tissue specificity, that is, it is expressed on the target tissue only and must be essential for the process of reproduction. It is known that the PH30 protein, which is expressed only on sperm, is involved in sperm egg binding and antibodies that bind to PH30 inhibit that interaction.

The cloning and characterization of human PH30 beta permits novel approaches for using PH30 as a target to control human fertility. PH30 beta protein or peptides can be used directly as an antigen to elicit an immune response directed to the whole or a relevant part of the PH30 beta chain protein. Testing of these approaches requires availability of sufficient quantities of PH30 beta protein. The cloning and sequencing of the mouse and human PH30 beta chain provides information necessary to recombinantly express all or part of the PH30 beta protein. These expressed proteins are used with or without adjuvant to immunize women or female mice. The elicited humoral immune responses are monitored by assays that use PH30 beta as antigen. Secreted antibodies in the female reproductive system will bind to the sperm head and disrupt fertilization. The availability of the recombinant mouse PH30 beta protein permits establishment of an animal model system for testing efficacy, reversibility and safety of specific methods of controlling fertility based on PH30.

A vaccine can contain one or more sperm surface proteins. Sperm surface proteins of the present invention can be combined with adjuvants which contain non-specific stimulators of the immune system. Proper use of adjuvants can induce a strong antibody response to foreign antigens (i.e., sperm surface proteins). The action of adjuvants is not fully understood, but most adjuvants incorporate two components. One is a substance designed to form a deposit which protects the antigen from catabolism. Two methods of forming a deposit are to use mineral oils or aluminum hydroxide precipitates. With mineral oils, such as Freund's adjuvant, the immunogen is prepared in a water-in-oil emulsion. For aluminum hydroxide, the immunogen is either adsorbed to preformed precipitants or is trapped during precipitation.

The second component required for an effective adjuvant is a substance that will stimulate the immune system nonspecifically. These substances stimulate the production of a large set of soluble peptide factors known as lymphokines. In turn, lymphokines stimulate the activity of antigen-processing cells directly and cause a local inflammatory reaction at the site of injection. A component of lipopolysaccharide known as lipid A is commonly used. Lipid A is available in a number of synthetic and natural forms that are much less toxic than lipopolysaccharides, but still retain most of the desirable adjuvant properties of the lipopolysaccharide molecules. Lipid A compounds are often delivered using liposomes. The two bacteria that are commonly used in adjuvants as non-specific stimulants are Bordatella pertussis and Mycobacterium tuberculosis. When used as whole bacteria, they must be heat-killed prior to use. The imnmunomodulatory mediators of B. pertussis include a lipopolysaccharide component and the pertussis toxin. The pertussis toxin has been purified and is available commercially. M. tuberculosis is commonly found in complete Freund's adjuvant. The most active component of M. tuberculosis has been localized to muramyl dipeptide which is available in a number of forms.

Immunizations (Inoculation and Booster Shots)

The subject to be immunized can be any mammal which possesses a competent immune system. Examples of subject mammals include humans and domestic animals (e.g. dogs, cats, cows, horses, etc.), as well as animals intended for experimental or other purposes (e.g., mice, rats, rabbits, etc.).

Two different criteria are important to consider in determining the proper dose for the initial immunization. First, the optimum dose to achieve the strongest response and second, the minimum dose likely to induce the production of useful polyclonal antibodies. Much of the injected material will be catabolized and cleared before reaching the appropriate target immune cell. The efficiency of this process will vary with host factors, the route of injection, the use of adjuvants, and the intrinsic nature of the surface protein injected. Thus, the effective dose delivered to the immune system may bear little relationship to the introduced dose and consequently dose requirements must be determined empirically. These determinations can be readily made by one skilled in the art. Secondary injections and later boost can be given with amounts similar to or less than the primary injection.

The route of injection is guided by three practical decisions: 1) what volume must be delivered; 2) what buffers and other components will be injected with the immunogen; and 3) how quickly should the immunogen be released into the lymphatics or circulation. For example, with rabbits, large volume injections normally are given at multiple subcutaneous sites. For mice, large volumes are only possible with intraperitoneal injections. If adjuvants or particulate matter are included in the injection, the immunogen should not be delivered intravenously. If a slow release or the inoculant is desired, the injections should be done either intramuscularly or intradermally. For immediate release, use intravenous injections.

Primary antibody responses often are very weak, particularly for readily catabolized, soluble antigens. Hence, secondary or booster injections are required after the initial immunization. A delay is needed before reintroducing the protein into a primed subject. A minimum of 2 or 3 weeks is recommended but greater intervals are possible. The antibody responses to secondary and subsequent injections is much stronger. Higher titers of antibody are reached, but more importantly, the nature and quantity of the antibodies present-in serum changes. These changes yield high-affinity antibodies. The intervals between secondary, tertiary and subsequent injections may also be varied, but usually need to be extended to allow the circulating level of antibody to drop enough to prevent rapid clearance of newly injected antigen.

Subsequent booster injections will be required to increase reduced circulating antibody for continued contraception. The actual intervals for these injections will differ from species to species. However, the intervals can be determined by one skilled in the art by monitoring serum levels of sperm surface protein antibodies.

In another embodiment, subjects can be administered with alloantisera, or monoclonal antibodies, directed to a sperm surface protein to achieve contraception. The alloantiserum is raised in another individual of the same species, isolated from the serum of the individual and prepared in a suitable carrier for injection into the recipient subject. Those skilled in the art are familiar with methods for preparing and formulating monoclonal antibodies for administration.

There is convincing evidence that naturally occurring antibodies to sperm cause infertility in women Bronson, R. A., et al., Fertility and Sterility, 42: 171-183 (1984)!. This infertility is better correlated with the antibody titers in cervical mucus than with the serum Clark, G. N., Amer. J Reprod. Immunol., 5:179-181 (1984)!. Presence of anti-sperm antibodies in the cervical mucus of infertile women results in poor sperm penetration through the cervical mucus and agglutination of the sperm, thereby reducing the number of sperm available for fertilization. Thus, success of a contraceptive vaccine depends in particular on the generation of mucosal immune responses involving sustained titers of antisperm antibodies in the female reproductive tract.

Generally, local application of the antigen is an effective way to stimulate an antibody response by that mucosa Mestecky, J., J Clin. Immunol., 7: 265-276 (1987)!. However, local mucosal immunization is ineffective in female reproductive tract due to the barrier function of the luminal epithelium and to rapid loss of antigen from the lumen of reproductive tract. Stability and adhesiveness of the antigen on the mucosal surface is important for the induction of the mucosal immune responses de Aizpurua, H. J. and Russell-Jones, G. J., J Exp. Med., 167: 440 (1988)!. Adhesive antigens are critical to successful mucosal immunization, not only because they are effective mucosal immunogens themselves, but also because they are carrier proteins for other antigens. Cholera toxin is a potent immunogen when given mucosally, but acts as an adjuvant when given in combination with other antigens McKenzie, S. J. and Halsey, J. A., J. Immunol., 133: 1818 (1984)!. Effective immunization is also dependent on the stability of the antigen on a mucosal surface. Many antigens for use in mucosal vaccines are poorly immunogenic because they are unable to survive in the acidic and proteolytic conditions of the mucosal surface O'Hagen, D. T., Curr. Opin. Infect. Dis., 3:393 (1990)!. The DL-lactide-co-glycolide (DL-PLG) microsphere, microparticle carrier system is one of the most suitable systems for mucosal immunization. DL-PLG microspheres protect the antigen at mucosal surface and are taken up by the mucosal lymphoid tissues where they induce mucosal immunity Eldridge, J. H. et al, Curr. Top. Microbiol. Immunol., 146: 59 (1989)!. Liposomes and inactivated micro-organisms also are used as microparticle carriers. Some parenteral adjuvants such as Avridine, a lipoidal amine and muramyl dipeptide (MDP), the active component of mycobacteria in Freund's complete adjuvant, also have been shown to be active as oral mucosal adjuvants and enhance mucosal immunization Anderson, A. O. and Reynolds, J. A., J. Reticuloendothel. Soc., 26(suppl): 667 (1979); Taubman, M. A., et al., Ann. NY Acad. Sci., 409: 637 (1983)!. Development of mucosal immune responses in female reproductive tract are optimized by using various adjuvants, micro particle carriers, by immunizing at local or remote mucosal surfaces or by combination of parenteral and mucosal immunization.

Utility of PH30 beta in Identification of Small Molecules that will Disrupt Sperm-erg Interaction and Fertilization

The comparison of the protein sequences of both mouse and human PH30 beta chain genes shows significant homology to a class of proteins called disintegrins found in the snake venoms. These proteins are known to bind a family of cell surface molecules called integrins and prevent their normal function in cell adhesion. On the basis of these homologies it is reasonable to conclude that the PH30 receptor on the oocyte is an integrin. Comparisons of the disintegrin domain sequences of guinea pig, mouse and human PH30 beta chain genes show significant differences in their putative ligand binding domain. In particular, the sequences in this region are different from other disintegrins and among the three species. The recombinant mouse and human PH30 beta proteins are used to make affinity resins to purify, identify and characterize mouse and human PH30 receptors. The recombinant PH30 beta also are used to determine its relative affinity to other integrins expressed in other tissues and are used as a ligand for cloning of the PH30 receptor.

Since the integrin recognition sequences in PH30 beta are species specific, the sequence information is necessary to identify small molecules that disrupt fertilization in a species specific manner. The recombinant mouse and human PH30 beta are used to set up screens to identify small molecules that act either as antagonist to PH30 receptor and disrupt PH30 binding or act as an agonist and stimulate PH30 receptor inducing transmembrane signaling, egg cortical granule release and zona reaction thus making the egg impenetrable for fertilization.

The present invention is further illustrated in the following exemplification.

EXAMPLE 1

Isolation of DNA Encoding Mouse and Human PH30 beta

A. cDNA Library Plating

One million independent recombinant bacteriophage from both a human testis cDNA library in kgt 11 (Clontech, Palo Alto, Calif.) and mouse testis cDNA library (Stratagene La Jolla, Calif.) in UNI-ZAP XR were plated. Plaque lifts were done in duplicate by placing a nitrocellulose filter on the plate for two minutes, and treating the filter with denaturing solution (0.5M NaOH, 1.5M NaCl), neutralization buffer (0.5M Tris pH 7.5, 1.5M NaCl) and 2× SSC (3M NaCl, 0.35M sodium citrate pH 7.0) for two minutes each. The filters were dried for thirty minutes at room temperature and then baked for two hours at 80° C. in a vacuum oven.

B. Generation of Probe:

A guinea pig PH30 beta cDNA was isolated by RT-PCR (reverse transcriptase-polymerase chain reaction) as a 1020 bp (base pairs), HindIII/Bam HI fragment, containing 94% of the coding sequence. This fragment was subcloned into pBluescript SK⁺ vector (Stratagene, La Jolla, Calif.) and verified by sequence analysis. A probe was made by nick translating the purified 1020 bp guinea pig PH30 beta fragment. The filters were probed at 42° C. for fifteen hours in hybridization solution (7 mM Tris pH 7.5, 40% formamide, 4× SSC, 0.8× Denhard's, 20 μg/ml of salmon sperm DNA and 10% Dextran sulfate) containing 10⁶ cpm (counts per minute)/ml of the labeled probe. The filters were washed twice at room temperature for fifteen minutes each with 2× SSC/0.2% SDS (sodium dodecyl sulfate), then twice at room temperature with 0.2× SSC/0. 1%SDS, then once at 42° C. for 30 minutes with 0.1× SSC/0.1%SDS. The filters were exposed to XAR film (Eastman Kodak Co, Rochester, N.Y.) for 15 hours. The positive plaques were picked into 1 ml of SM (0.1M NaCl, 10 mM Magnesium Sulphate, 2% gelatin, 50 mM Tris pH 7.5) and screened again as described above. After four rounds of screening, the purified plaques were obtained.

Purified plaques of mouse testicular library were subcloned into pBluescript SK⁺ vector using the EX ASSIT helper phage and SOLR cells (Stratagene, La Jolla, Calif.). DNA from the purified plaques of human testicular library was isolated using light PLG 2 tubes and following manufacturer's (Clontech, Palo Alto, Calif.) directions. The DNA was then digested with the restriction enzyme EcoRI and ligated into pBluescript SK⁺ and was used to transform competent E. coli strain HB101 cells.

C. DNA Sequencing and Analysis:

Cloned inserts were sequenced on both strands using the Sequenase kit (United States Biochemical, Cleveland, Ohio). Sequences were analyzed by searching GeneBank and EMBL DNA sequence database using the FASTA program (University of Wisconsin, Genetics Computer Group) and sequence comparisons were done using the GAP program.

D. Characterization of cDNA Clones:

The screening of the mouse testicular library with a 1020 bp guinea pig PH30 beta probe resulted in the isolation of a 1.7 kb (kilo base pair) cDNA clone. This cDNA clone contains a 1371 nucleotide open reading frame and a 329 nucleotide 3' untranslated region. When mature parts of the guinea pig and mouse PH30 beta were compared, the mouse PH30 beta clone showed a maximum of 63% identity to guinea pig PH30 beta at the nucleotide level. The amino terminal 103 residues of the deduced 457 amino acid sequence represents the precursor regions of the mouse PH30 beta that are cleaved off at sperm maturation. At the amino acid level the mature mouse, and guinea pig PH30 betas were 54% identical with all the cysteines lining up.

The human testicular cDNA library screening identified a 2.331 kb cDNA which contains an open reading frame of 1959 nucleotides and 372 nucleotide 3' untranslated region. The human PH30 beta clone was 63 and 67% identical in its open reading frame to mouse and guinea pig PH30 beta genes, respectively. Comparison of the derived 653 amino acid sequence with the mouse and guinea pig PH30 beta indicates that the amino terminal 299 represents the precursor and carboxy terminal 354 amino acids represent the mature part of human PH30 beta respectively. The amino acid sequence of the mature human PH30 beta was 54% homologous to mature guinea pig and mouse PH30 beta proteins.

Protein sequence comparison of mouse and human PH30 beta to guinea pig PH30 beta and snake venom disintegrins indicated significant homology. This analysis revealed similar structural organization and indicated the presence of metalloprotease and disintegrin domains in these proteins.

Metalloprotease domains of mouse and human PH30 beta shared significant similarity with the metalloprotease domains of guinea pig PH30 beta but less similarity to the metalloprotease domain of guinea pig PH30 alpha or other disintegrins. The active site signature sequence of zinc-dependent metalloproteases is present in PH30 alpha and the snake venom disintegrins, Jararhagin and Trigramin. Wolfsberg, T. G., et al., Proc. Natl. Acad. Sci. USA 90: 10783-10797 (1 993)!. Similar to guinea pig PH30 beta, the mouse and human metalloprotease domain lacks the active site signature sequence and both were 80% identical to guinea pig PH30 beta and only 30% identical to guinea pig PH30 alpha metalloprotease active site sequence. Human and guinea pig PH30 beta metalloprotease domains were 60% identical.

Similar to guinea pig PH30 beta, the mouse and human PH30 beta also contain a disintegrin domain. The disintegrin domain in mouse PH30 beta contains 91 amino acids (residues 111-202) and in human, 93 amino acids (residues 299-392). Most disintegrins of snake venom contain a consensus integrin binding sequence RGD. Another family of snake venom disintegrins that are linked to a carboxyl terminus cysteine rich domain, lack the RGD tripeptide but contain a unique tripeptide and adjacent cysteine. Guinea pig, mouse and human PH30 beta proteins also do not contain RGD tripeptide and share more similarity with this later family of disintegrins. These snake venom disintegrins and disintegrin domains of guinea pig, mouse and human PH30 beta contain a negatively charged residue at the carboxyl end of the tripeptide sequence. The integrin binding sequence of guinea pig PH30 beta is TDE. One skilled in the art would have expected that the integrin binding site of PH30 beta of other mammalian species would also be TDE. However, after isolation of human and mouse PH30 beta, it was found that this was not the case. It was unexpectedly discovered that the critical sequence at the integrin binding site was not conserved. Comparisons of guinea pig, mouse and human PH30 beta disintegrin domains showed significant variation in their putative integrin binding sequences although the carboxy terminus end of these domains were identical. The putative integrin binding residues in PH30 beta were QDE in mouse and FEE in human. These differences in the integrin binding sequences between species were an unexpected and surprising finding.

Both mouse and human PH30 beta contain an epidermal growth factor like repeat and a transmembrane domain that are 60% identical to similar regions of guinea pig PH30 beta.

EXAMPLE 2

Cloning of the 5' end of Mouse and Human PH30 Beta

The 5' ends of mouse and human PH30 beta were cloned using the Gibco BRL "5' RACE System for Rapid Amplification of cDNA Ends" and following manufacturer's protocols. 2 oligonucleotides were synthesized for each template. Oligo 1 was an antisense primer and Oligo 2 was also an antisense primer, internal to oligo 1, and contained in the CAU sequences on the 5' end to facilitate cloning. Oligo 1 was annealed to mouse or human testis mRNA and a cDNA copy was made using SuperScript II Reverse Transcriptase. The MRNA template was degraded with Rnase H. The single strands cDNA copy was purified with GlassMAX Spin columns and was then tailed on the 3' end with dCTP and terminal transferase. The tailed cDNA copy was then amplified using a supplied anchor primer that contains the 5' CAU cloning site and oligo 2. The amplification system was Taq polymerase. The amplified product was then gel purified, treated with Uracil DNA Glycosylase, subcloned into the vector pAMP1 and then transformed into competent E. coli DH5 cells. Colonies were identified which had subcloned fragment and these colonies were sequenced as described previously.

The complete mouse cDNA sequence and the deduced amino acid sequence of the mouse PH30 beta protein is shown in SEQ ID NO: 5 and SEQ ID NO: 6. The complete human cDNA sequence and the deduced amino acid sequence of the human PH30 beta protein is shown in SEQ ID NO: 7 and SEQ ID NO: 8.

At the nucleotide level, the complete human PH30 beta shares 68% identity with mouse and 68.6% identity with guinea pig PH30 beta, respectively. Mouse and guinea pig DNA sequences are 65.5% identical. The amino acid sequence of the human PH30 beta is 58.9% identical to mouse and 56.5% identical to guinea pig PH30 beta. At the amino acid level, the mouse and guinea pig PH30 beta are 55.2% identical.

EXAMPLE 3

Contraceptive Vaccination by the Administration of PH30 beta Protein

Female or male mice (about 7 weeks old at the time of first injection) receive two injections of PH30 beta in the amounts stated below. Recombinant or native PH30 beta, purified from cell line or sperm by mAb-affinity chromatography or biochemical methods, shows at least 90% purity (i.e., no more than 10% detectable contaminants) using silver-staining of purified protein on SDS gels. Purity of each PH30 preparation used for immunization of females or males is verified by SDS polyacrylamide gel electrophoresis and silver staining. The affinity-purified PH30 beta, in 0.375 ml phosphate-buffered saline (PBS) containing 3 mM octyglucoside (OG) is emulsified with 0.375 ml complete Freund's adjuvant (CFA). Each animal receives 0.1 ml of the emulsion subcutaneously in the back and 0.05 ml intramuscularly in a rear leg. About 3 weeks later, the same amount of PH30 beta in PBS and 3 mM OG is emulsified with incomplete Freund's adjuvant (IFA), and is injected in the same sites in each animal. Control females and males receive the same injections on the same schedule and containing PBS and 3 mM OG and CFA or IFA, but lacking PH30 beta. To allow the injected females to mate, about 6 weeks after the initial injection they are housed with males for 10 days. Each cage contains one male (13 weeks old), one PH30 beta immunized female, and from 2-4 control injected females. 24 hours after the grouping, females are checked visually daily for the vaginal plugs. Two weeks after the initiation of the mating the, females are removed into individual cages. After three weeks the pregnant females having litters and progeny are counted. To allow the injected males to mate, about six weeks after the initial injection, each injected male is housed with two females (10-13 weeks) for 10 days. The females and males are then separated and after an additional 3 weeks pups are counted.

EXAMPLE 4

Use of PH30 Disintegrin Peptides as Inhibitor of Sperm Fusion to Egg Plasma Membrane

Peptides from the PH30 β disintegrin domain are tested for inhibition of sperm binding to the egg plasma membrane.

The fusion inhibition assay is carried out as follows. Young female mice (8-10 weeks of age) are injected with 5 units of pregnant mare's serum (PMS) in 0.9 NaCl intraperitoneally. 48 hours later, the mice are injected IP with 5 units of hCG (human chorionic gonadotrophin) in 0.9% NaCl to trigger super ovulation. 14-16 hours after hCG injection, the ovulated oocytes are collected and treated with hyaluronidase to remove cumulus cells. The zona pellucida is removed with a mixture of proteases. The zona pellucida free eggs are incubated in culture media with peptide at a specified concentration for 30 minutes Hogan, B., et al., Manipulating The Mouse Embryo, 91-101, (1986)!. Sperm collected from the epididymis of male mice is capacitated by incubation and acrosome reacted as described by Fleming and Yanagimachi Gamete Res. 4, 253-273 (1981)! and added to the eggs and incubated for 15 minutes. The eggs are then transferred to a sperm free culture medium and incubated for an additional 1 hour and 45 minutes. The eggs are then fixed and stained as described by Primakoff et al., J. Cell. Biol. 104, 141 (1987)!. The total number of swollen sperm heads are then counted. Swollen sperm heads are an indication that the sperm and egg have fused.

On the basis of these observations, several indices are calculated. The fertilization index (F.I.) is determined by dividing the total number of swollen heads by the total number of eggs. The fertilization rate (F.R.) is the percentage of eggs fertilized. The percent inhibition is determined by dividing the fertilization index of the experimental peptide by the fertilization index of the control peptide.

The PH30 β disintegrin domain represents an epitope which is critical in sperm-egg fusion. Antibodies which bind specifically to this epitope block sperm/egg fusion.

EXAMPLE 5

Use of PH30 beta to Identify Small Molecules that will disrupt sperm-egg Interaction and Fertilization

A. Identification of PH30 beta receptor antagonists:

Identification of compounds that specifically interfere with the binding of PH30 to their receptor on the egg, has been limited due to unavailability of the sufficient quantities of PH30 protein and normal human eggs. The availability of the rPH30 beta facilitates the identification and cloning of PH30 beta receptor integrin cDNAs. These PH30 beta receptor cDNAs are used to generate recombinant PH30 beta receptors. The alternative source of PH30 beta receptors facilitates identification of substances that affect the binding of PH30 beta to its receptors.

Using conventional methods, the Chinese Hamster Ovary cells are transfected with cDNAs encoding the PH30 beta receptor to produce a stable transformed cell which expresses human PH30 beta receptor integrin in large quantities. Such a transformed cell provides a consistent source of recombinant PH30 beta receptors and is useful in the characterization of the binding of PH30 beta to its receptor and for establishing assays to screen for compounds that inhibit PH30 binding to its receptor.

Selectivity of the compounds to PH30 beta receptor is examined by using cell lines that express other integrin receptors that contain the same beta subunit and closely related alpha chain. Compounds that specifically inhibit PH30 beta/receptor interaction are tested further in biological assays, like inhibition of sperm-egg fusion assay and egg cortical granule release assay to determine their efficacy in inhibiting fertilization.

B. Protocol for PH30 beta antagonist screen:

Cells expressing PH30 beta receptor are treated with extraction buffer (50 mM Tris pH 7.6, 100 mM n-Octyl β-D-Glucopyranoside, 150 mM NaCl, 1 mM MgCl₂ and 1 mM CaCl₂) and soluble material is separated by centrifugation and stored frozen at -80° C. In an assay tube the 15 μl water, 80 μl of assay buffer (125 mM Tris pH 7.6, 187.5 mM NaCl, 1.25 mM CaCl₂, 1.25 mM MgCl₂ and 1.25% BSA) and 5 μl of sample compound or control (40 μM of cold PH30 beta) are added and mixed with 50 μl of ¹²⁵ I-PH30 beta (final concentration 40 pM) and 50 μl of cell extract (final protein concentration 250 μg/ml). The tubes are incubated at room temperature for 1 hour. Following incubation the samples are harvested using Tomtec Mach II-6× 16 cell harvester and printed filtermat cat. # 1205-404. Filters are dried and counted in LKB/Wallac Beta Plate counter. Calculations and Interpretations: ##EQU1## When % inhibition >60 and the inhibition is dose related, the sample will be considered active.

C. Sperm-Oocyte fusion assay:

Young female mice (approximately 8-10 weeks of age) are injected with 5 units of pregnant mare's serum (PMS) in 0.9 NaCl intraperitoneally. 48 hours later, the mice are injected IP with 5 units of hCG (human chorionic gonadotrophin) in 0.9% NaCl to trigger super ovulation. 14-16 hours after hCG injection, the ovulated oocytes are collected and treated with hyaluronidase to remove cumulus cells. Zona pellucida is removed by treating eggs briefly with 0.1 mg/ml of chymotrypsin. Oocytes are washed with Hepes buffered culture medium and are loaded with a fluorescent stain 4',6-diamidino-2-phenylindole dihydrochloride (DAPI) by incubating at 37° C. for 30 minutes. Oocytes are then washed with medium and incubated with rPH30 beta or inhibitor compound for 30 minutes followed by another 30 minute incubation with 1×10⁴ sperms that have been previously capacitated by incubating with calcium ionophore. After incubation, the oocytes are washed, mounted and examined by light microscopy and scored for the presence of fluorescent swollen sperm heads with associated tails in cytoplasm. ##EQU2## In the absence of any inhibitor >90% oocytes are fertilized. When the sperm-oocyte fusion is inhibited >60% and the inhibition is dose related the compound will be considered active.

While the invention has been described and illustrated with reference to certain preferred embodiments thereof, those skilled in the art will appreciate that various changes, modifications and substitutions can be made therein without departing from the spirit and scope of the invention. It is intended, therefore, that the invention be limited only by the scope of the claims which follow and that such claims be interpreted as broadly as is reasonable.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 8     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 2373 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     - GGCCAAGATT TTCAGAATTT CTGCCACTAC CAAGGGTATA TTGAAGGTTA TC - #CAAAATCT       60     - GTGGTGATGG TTAGCACATG TACTGGACTC AGGGGCGTAC TACAGTTTGA AA - #ATGTTAGT      120     - TATGGAATAG AACCCCTGGA GTCTTCAGTT GGCTTTGAAC ATGTAATTTA CC - #AAGTAAAA      180     - CATAAGAAAG CAGATGTTTC CTTATATAAT GAGAAGGATA TTGAATCAAG AG - #ATCTGTCC      240     - TTTAAATTAC AAAGCGCAGA GCCACAGCAA GATTTTGCAA AGTATATAGA AA - #TGCATGTT      300     - ATAGTTGAAA AACAATTGTA TAATCATATG GGGTCTGATA CAACTGTTGT CG - #CTCAAAAA      360     - GTTTTCCAGT TGATTGGATT GACGAATGCT ATTTTTGTTT CATTTAATAT TA - #CAATTATT      420     - CTGTCTTCAT TGGAGCTTTG GATAGATGAA AATAAAATTG CAACCACTGG AG - #AAGCTAAT      480     - GAGTTATTAC ACACATTTTT AAGATGGAAA ACATCTTATC TTGTTTTACG TC - #CTCATGAT      540     - GTGGCATTTT TACTTGTTTA CAGAGAAAAG TCAAATTATG TTGGTGCAAC CT - #TTCAAGGG      600     - AAGATGTGTG ATGCAAACTA TGCAGGAGGT GTTGTTCTGC ACCCCAGAAC CA - #TAAGTCTG      660     - GAATCACTTG CAGTTATTTT AGCTCAATTA TTGAGCCTTA GTATGGGGAT CA - #CTTATGAT      720     - GACATTAACA AATGCCAGTG CTCAGGAGCT GTCTGCATTA TGAATCCAGA AG - #CAATTCAT      780     - TTCAGTGGTG TGAAGATCTT TAGTAACTGC AGCTTCGAAG ACTTTGCACA TT - #TTATTTCA      840     - AAGCAGAAGT CCCAGTGTCT TCACAATCAG CCTCGCTTAG ATCCTTTTTT CA - #AACAGCAA      900     - GCAGTGTGTG GTAATGCAAA GCTGGAAGCA GGAGAGGAGT GTGACTGTGG GA - #CTGAACAG      960     - GATTGTGCCC TTATTGGAGA AACATGCTGT GATATTGCCA CATGTAGATT TA - #AAGCCGGT     1020     - TCAAACTGTG CTGAAGGACC ATGCTGCGAA AACTGTCTAT TTATGTCAAA AG - #AAAGAATG     1080     - TGTAGGCCTT CCTTTGAAGA ATGCGACCTC CCTGAATATT GCAATGGATC AT - #CTGCATCA     1140     - TGCCCAGAAA ACCACTATGT TCAGACTGGG CATCCGTGTG GACTGAATCA AT - #GGATCTGT     1200     - ATAGATGGAG TTTGTATGAG TGGGGATAAA CAATGTACAG ACACATTTGG CA - #AAGAAGTA     1260     - GAGTTTGGCC CTTCAGAATG TTATTCTCAC CTTAATTCAA AGACTGATGT AT - #CTGGAAAC     1320     - TGTGGTATAA GTGATTCAGG ATACACACAG TGTGAAGCTG ACAATCTGCA GT - #GCGGAAAA     1380     - TTAATATGTA AATATGTAGG TAAATTTTTA TTACAAATTC CAAGAGCCAC TA - #TTATTTAT     1440     - GCCAACATAA GTGGACATCT CTGCATTGCT GTGGAATTTG CCAGTGATCA TG - #CAGACAGC     1500     - CAAAAGATGT GGATAAAAGA TGGAACTTCT TGTGGTTCAA ATAAGGTTTG CA - #GGAATCAA     1560     - AGATGTGTGA GTTCTTCATA CTTGGGTTAT GATTGTACTA CTGACAAATG CA - #ATGATAGA     1620     - GGTGTATGCA ATAACAAAAA GCACTGTCAC TGTAGTGCTT CATATTTACC TC - #CAGATTGC     1680     - TCAGTTCAAT CAGATCTATG GCCTGGTGGG AGTATTGACA GTGGCAATTT TC - #CACCTGTA     1740     - GCTATACCAG CCAGACTCCC TGAAAGGCGC TACATTGAGA ACATTTACCA TT - #CCAAACCA     1800     - ATGAGATGGC CATTTTTCTT ATTCATTCCT TTCTTTATTA TTTTCTGTGT AC - #TGATTGCT     1860     - ATAATGGTGA AAGTTAATTT CCAAAGGAAA AAATGGAGAA CTGAGGACTA TT - #CAAGCGAT     1920     - GAGCAACCTG AAAGTGAGAG TGAACCTAAA GGGTAGTCTG GACAACAGAG AT - #GCCATGAT     1980     - ATCACTTCTT CTAGAGTAAT TATCTGTGAT GGATGGACAC AAAAAAATGG AA - #AGAAAAGA     2040     - ATGTACATTA CCTGGTTTCC TGGGATTCAA ACCTGCATAT TGTGATTTTA AT - #TTGACCAG     2100     - AAAATATGAT ATATATGTAT AATTTCACAG ATAATTTACT TATTTAAAAA TG - #CATGATAA     2160     - TGAGTTTTAC ATTACAAATT TCTGTTTTTT TAAAGTTATC TTACGCTATT TC - #TGTTGGTT     2220     - AGTAGACACT AATTCTGTCA GTAGGGGCAT GGTATAAGGA AATATCATAA TG - #TAATGAGG     2280     - TGGTACTATG ATTAAAAGCC ACTGTTACAT TTCAAAAAAA AAAAAAAAAA AC - #CATCTAAA     2340     #       2373       AAGA ATTATATTAT CAA     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 651 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     - Gly Gln Asp Phe Gln Asn Phe Cys His Tyr Gl - #n Gly Tyr Ile Glu Gly     #                15     - Tyr Pro Lys Ser Val Val Met Val Ser Thr Cy - #s Thr Gly Leu Arg Gly     #            30     - Val Leu Gln Phe Glu Asn Val Ser Tyr Gly Il - #e Glu Pro Leu Glu Ser     #        45     - Ser Val Gly Phe Glu His Val Ile Tyr Gln Va - #l Lys His Lys Lys Ala     #    60     - Asp Val Ser Leu Tyr Asn Glu Lys Asp Ile Gl - #u Ser Arg Asp Leu Ser     #80     - Phe Lys Leu Gln Ser Ala Glu Pro Gln Gln As - #p Phe Ala Lys Tyr Ile     #                95     - Glu Met His Val Ile Val Glu Lys Gln Leu Ty - #r Asn His Met Gly Ser     #           110     - Asp Thr Thr Val Val Ala Gln Lys Val Phe Gl - #n Leu Ile Gly Leu Thr     #       125     - Asn Ala Ile Phe Val Ser Phe Asn Ile Thr Il - #e Ile Leu Ser Ser Leu     #   140     - Glu Leu Trp Ile Asp Glu Asn Lys Ile Ala Th - #r Thr Gly Glu Ala Asn     145                 1 - #50                 1 - #55                 1 -     #60     - Glu Leu Leu His Thr Phe Leu Arg Trp Lys Th - #r Ser Tyr Leu Val Leu     #               175     - Arg Pro His Asp Val Ala Phe Leu Leu Val Ty - #r Arg Glu Lys Ser Asn     #           190     - Tyr Val Gly Ala Thr Phe Gln Gly Lys Met Cy - #s Asp Ala Asn Tyr Ala     #       205     - Gly Gly Val Val Leu His Pro Arg Thr Ile Se - #r Leu Glu Ser Leu Ala     #   220     - Val Ile Leu Ala Gln Leu Leu Ser Leu Ser Me - #t Gly Ile Thr Tyr Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Asp Ile Asn Lys Cys Gln Cys Ser Gly Ala Va - #l Cys Ile Met Asn Pro     #               255     - Glu Ala Ile His Phe Ser Gly Val Lys Ile Ph - #e Ser Asn Cys Ser Phe     #           270     - Glu Asp Phe Ala His Phe Ile Ser Lys Gln Ly - #s Ser Gln Cys Leu His     #       285     - Asn Gln Pro Arg Leu Asp Pro Phe Phe Lys Gl - #n Gln Ala Val Cys Gly     #   300     - Asn Ala Lys Leu Glu Ala Gly Glu Glu Cys As - #p Cys Gly Thr Glu Gln     305                 3 - #10                 3 - #15                 3 -     #20     - Asp Cys Ala Leu Ile Gly Glu Thr Cys Cys As - #p Ile Ala Thr Cys Arg     #               335     - Phe Lys Ala Gly Ser Asn Cys Ala Glu Gly Pr - #o Cys Cys Glu Asn Cys     #           350     - Leu Phe Met Ser Lys Glu Arg Met Cys Arg Pr - #o Ser Phe Glu Glu Cys     #       365     - Asp Leu Pro Glu Tyr Cys Asn Gly Ser Ser Al - #a Ser Cys Pro Glu Asn     #   380     - His Tyr Val Gln Thr Gly His Pro Cys Gly Le - #u Asn Gln Trp Ile Cys     385                 3 - #90                 3 - #95                 4 -     #00     - Ile Asp Gly Val Cys Met Ser Gly Asp Lys Gl - #n Cys Thr Asp Thr Phe     #               415     - Gly Lys Glu Val Glu Phe Gly Pro Ser Glu Cy - #s Tyr Ser His Leu Asn     #           430     - Ser Lys Thr Asp Val Ser Gly Asn Cys Gly Il - #e Ser Asp Ser Gly Tyr     #       445     - Thr Gln Cys Glu Ala Asp Asn Leu Gln Cys Gl - #y Lys Leu Ile Cys Lys     #   460     - Tyr Val Gly Lys Phe Leu Leu Gln Ile Pro Ar - #g Ala Thr Ile Ile Tyr     465                 4 - #70                 4 - #75                 4 -     #80     - Ala Asn Ile Ser Gly His Leu Cys Ile Ala Va - #l Glu Phe Ala Ser Asp     #               495     - His Ala Asp Ser Gln Lys Met Trp Ile Lys As - #p Gly Thr Ser Cys Gly     #           510     - Ser Asn Lys Val Cys Arg Asn Gln Arg Cys Va - #l Ser Ser Ser Tyr Leu     #       525     - Gly Tyr Asp Cys Thr Thr Asp Lys Cys Asn As - #p Arg Gly Val Cys Asn     #   540     - Asn Lys Lys His Cys His Cys Ser Ala Ser Ty - #r Leu Pro Pro Asp Cys     545                 5 - #50                 5 - #55                 5 -     #60     - Ser Val Gln Ser Asp Leu Trp Pro Gly Gly Se - #r Ile Asp Ser Gly Asn     #               575     - Phe Pro Pro Val Ala Ile Pro Ala Arg Leu Pr - #o Glu Arg Arg Tyr Ile     #           590     - Glu Asn Ile Tyr His Ser Lys Pro Met Arg Tr - #p Pro Phe Phe Leu Phe     #       605     - Ile Pro Phe Phe Ile Ile Phe Cys Val Leu Il - #e Ala Ile Met Val Lys     #   620     - Val Asn Phe Gln Arg Lys Lys Trp Arg Thr Gl - #u Asp Tyr Ser Ser Asp     625                 6 - #30                 6 - #35                 6 -     #40     - Glu Gln Pro Glu Ser Glu Ser Glu Pro Lys Gl - #y     #               650     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1768 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     - GGCACGAGCG ATTATGTTGG CGCTACCTAT CAAGGGAAGA TGTGTGACAA GA - #ACTATGCA       60     - GGAGGAGTTG CTTTGCACCC CAAAGCCGTA ACTCTGGAAT CACTTGCAAT TA - #TTTTAGTT      120     - CAGCTGCTGA GCCTCAGCAT GGGGCTAGCG TATGACGACG TGAACAAGTG CC - #AGTGTGGC      180     - GTACCTGTCT GCGTGATGAA CCCGGAAGCG CCTCACTCCA GCGGTGTCCG GG - #CCTTCAGT      240     - AACTGCAGCA TGGAGGACTT TTCCAAGTTT ATCACAAGTC AAAGCTCCCA CT - #GTCTGCAG      300     - AACCAGCCAA CGCTACAGCC ATCTTACAAG ATGGCGGTCT GTGGGAATGG AG - #AGGTGGAA      360     - GAAGATGAAA TTTGCGACTG TGGAAAGAAG GGCTGTGCAG AAATGCCCCC GC - #CATGCTGT      420     - AACCCCGACA CCTGTAAGCT GTCAGATGGC TCCGAGTGCT CCAGCGGGAT AT - #GCTGCAAC      480     - TCGTGCAAGC TGAAGCGGAA AGGGGAGGTT TGCAGGCTTG CCCAAGATGA GT - #GTGATGTC      540     - ACAGAGTACT GCAACGGCAC ATCCGAAGTG TGTGAAGACT TCTTTGTTCA AA - #ACGGTCAC      600     - CCATGTGACA ATCGCAAGTG GATCTGTATT AACGGCACCT GTCAGAGTGG AG - #AACAGCAG      660     - TGCCAGGATC TATTTGGCAT CGATGCAGGC TTTGGTTCAA GTGAATGTTT CT - #GGGAGCTG      720     - AATTCCAAGA GCGACATATC TGGGAGCTGT GGAATCTCTG CTGGGGGATA CA - #AGGAATGC      780     - CCACCTAATG ACCGGATGTG TGGGAAAATA ATATGTAAAT ACCAAAGTGA AA - #ATATACTA      840     - AAATTGAGGT CTGCCACTGT TATTTATGCC AATATAAGCG GGCATGTCTG CG - #TTTCCCTG      900     - GAATATCCCC AAGGTCATAA TGAGAGCCAG AAGATGTGGG TGAGAGATGG AA - #CCGTCTGC      960     - GGGTCAAATA AGGTTTGCCA GAATCAAAAA TGTGTAGCAG ACACTTTCTT GG - #GCTATGAT     1020     - TGCAACCTGG AAAAATGCAA CCACCATGGT GTATGTAATA ACAAGAAGAA CT - #GCCACTGT     1080     - GACCCCACAT ACTTACCTCC AGATTGTAAA AGAATGAAAG ATTCATATCC TG - #GCGGGAGC     1140     - ATTGATAGTG GCAACAAGGA AAGGGCTGAA CCCATCCCTG TACGGCCCTA CA - #TTGCAAGT     1200     - CGTTACCGCT CCAAGTCTCC ACGGTGGCCA TTTTTCTTGA TCATCCCTTT CT - #ACGTTGTG     1260     - ATCCTTGTCC TGATTGGGAT GCTGGTAAAA GTCTATTCCC AAAGGATGAA AT - #GGAGAATG     1320     - GATGACTTCT CAAGCGAAGA GCAATTTGAA AGTGAAAGTG AATCCAAAGA CT - #AGTCTGGA     1380     - CAGATTCCAC AATGTCACAA GTAATTCTCT TCAGTGGACA GAAAAAAAAG TG - #GAAAAGAA     1440     - AAGCCTATGC ATTATCTTGC CTGAAAGTCA AGCCTGCATA TCGTGGTCTC CA - #TCAGGCCA     1500     - GAAATCATAT CTCTCCATTA CACATGTATG ATACATATGT GTGTATATTA TT - #CCATAAAT     1560     - GATTTACTTG TAAGAAATGA ATGATTATGA ATTTCATATT ATACTTTGAT AT - #TTTACCCT     1620     - ATTTCTGGTA GTCGGTAGTC ATCAATTGTA TTTTCTAGTA GGTACATTAT AG - #AAAAGGCT     1680     - ATAAGAAAAT AAATGTGGTA CCATAATAAT CAATATCATA CAACCACCAT CT - #AAAAAAGG     1740     #           1768   TTAT ATTATCAA     - (2) INFORMATION FOR SEQ ID NO:4:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 457 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     - Gly Thr Ser Asp Tyr Val Gly Ala Thr Tyr Gl - #n Gly Lys Met Cys Asp     #                15     - Lys Asn Tyr Ala Gly Gly Val Ala Leu His Pr - #o Lys Ala Val Thr Leu     #            30     - Glu Ser Leu Ala Ile Ile Leu Val Gln Leu Le - #u Ser Leu Ser Met Gly     #        45     - Leu Ala Tyr Asp Asp Val Asn Lys Cys Gln Cy - #s Gly Val Pro Val Cys     #    60     - Val Met Asn Pro Glu Ala Pro His Ser Ser Gl - #y Val Arg Ala Phe Ser     #80     - Asn Cys Ser Met Glu Asp Phe Ser Lys Phe Il - #e Thr Ser Gln Ser Ser     #                95     - His Cys Leu Gln Asn Gln Pro Thr Leu Gln Pr - #o Ser Tyr Lys Met Ala     #           110     - Val Cys Gly Asn Gly Glu Val Glu Glu Asp Gl - #u Ile Cys Asp Cys Gly     #       125     - Lys Lys Gly Cys Ala Glu Met Pro Pro Pro Cy - #s Cys Asn Pro Asp Thr     #   140     - Cys Lys Leu Ser Asp Gly Ser Glu Cys Ser Se - #r Gly Ile Cys Cys Asn     145                 1 - #50                 1 - #55                 1 -     #60     - Ser Cys Lys Leu Lys Arg Lys Gly Glu Val Cy - #s Arg Leu Ala Gln Asp     #               175     - Glu Cys Asp Val Thr Glu Tyr Cys Asn Gly Th - #r Ser Glu Val Cys Glu     #           190     - Asp Phe Phe Val Gln Asn Gly His Pro Cys As - #p Asn Arg Lys Trp Ile     #       205     - Cys Ile Asn Gly Thr Cys Gln Ser Gly Glu Gl - #n Gln Cys Gln Asp Leu     #   220     - Phe Gly Ile Asp Ala Gly Phe Gly Ser Ser Gl - #u Cys Phe Trp Glu Leu     225                 2 - #30                 2 - #35                 2 -     #40     - Asn Ser Lys Ser Asp Ile Ser Gly Ser Cys Gl - #y Ile Ser Ala Gly Gly     #               255     - Tyr Lys Glu Cys Pro Pro Asn Asp Arg Met Cy - #s Gly Lys Ile Ile Cys     #           270     - Lys Tyr Gln Ser Glu Asn Ile Leu Lys Leu Ar - #g Ser Ala Thr Val Ile     #       285     - Tyr Ala Asn Ile Ser Gly His Val Cys Val Se - #r Leu Glu Tyr Pro Gln     #   300     - Gly His Asn Glu Ser Gln Lys Met Trp Val Ar - #g Asp Gly Thr Val Cys     305                 3 - #10                 3 - #15                 3 -     #20     - Gly Ser Asn Lys Val Cys Gln Asn Gln Lys Cy - #s Val Ala Asp Thr Phe     #               335     - Leu Gly Tyr Asp Cys Asn Leu Glu Lys Cys As - #n His His Gly Val Cys     #           350     - Asn Asn Lys Lys Asn Cys His Cys Asp Pro Th - #r Tyr Leu Pro Pro Asp     #       365     - Cys Lys Arg Met Lys Asp Ser Tyr Pro Gly Gl - #y Ser Ile Asp Ser Gly     #   380     - Asn Lys Glu Arg Ala Glu Pro Ile Pro Val Ar - #g Pro Tyr Ile Ala Ser     385                 3 - #90                 3 - #95                 4 -     #00     - Arg Tyr Arg Ser Lys Ser Pro Arg Trp Pro Ph - #e Phe Leu Ile Ile Pro     #               415     - Phe Tyr Val Val Ile Leu Val Leu Ile Gly Me - #t Leu Val Lys Val Tyr     #           430     - Ser Gln Arg Met Lys Trp Arg Met Asp Asp Ph - #e Ser Ser Glu Glu Gln     #       445     - Phe Glu Ser Glu Ser Glu Ser Lys Asp     #   455     - (2) INFORMATION FOR SEQ ID NO:5:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 2553 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 17..2221     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     #CTG AGT GGG CTG         49 CTC ATC TTG CTT CTA     #Met Arg Leu Ile Leu Leu Leu Leu Ser Gly L - #eu     #                 10     - AGT GAA CTT GGC GGC CTT AGC CAG TCC CAA AC - #A GAA GGC ACT CGT GAG       97     Ser Glu Leu Gly Gly Leu Ser Gln Ser Gln Th - #r Glu Gly Thr Arg Glu     #             25     - AAA TTA CAC GTG CAA GTC ACA GTG CCA GAG AA - #A ATC CGG TCC GTC ACA      145     Lys Leu His Val Gln Val Thr Val Pro Glu Ly - #s Ile Arg Ser Val Thr     #         40     - AGC AAT GGC TAC GAA ACA CAG GTG ACC TAC AA - #T CTC AAA ATC GAA GGG      193     Ser Asn Gly Tyr Glu Thr Gln Val Thr Tyr As - #n Leu Lys Ile Glu Gly     #     55     - AAA ACA TAC ACC TTG GAC CTA ATG CAA AAA CC - #G TTC TTG CCT CCC AAC      241     Lys Thr Tyr Thr Leu Asp Leu Met Gln Lys Pr - #o Phe Leu Pro Pro Asn     # 75     - TTT AGA GTA TAC AGT TAT GAC AAC GCA GGA AT - #C ATG AGG TCT CTT GAG      289     Phe Arg Val Tyr Ser Tyr Asp Asn Ala Gly Il - #e Met Arg Ser Leu Glu     #                 90     - CAG AAG TTT CAG AAT ATC TGC TAC TTC CAA GG - #A TAC ATT GAA GGT TAT      337     Gln Lys Phe Gln Asn Ile Cys Tyr Phe Gln Gl - #y Tyr Ile Glu Gly Tyr     #            105     - CCA AAT TCT ATG GTG ATT GTT AGC ACA TGT AC - #T GGA CTG AGG GGT TTT      385     Pro Asn Ser Met Val Ile Val Ser Thr Cys Th - #r Gly Leu Arg Gly Phe     #       120     - CTC CAA TTT GGA AAC GTT AGC TAT GGA ATT GA - #A CCT CTG GAA TCT TCC      433     Leu Gln Phe Gly Asn Val Ser Tyr Gly Ile Gl - #u Pro Leu Glu Ser Ser     #   135     - AGT GGT TTT GAA CAC GTG ATC TAC CAA GTG GA - #A CCT GAG AAA GGA GGT      481     Ser Gly Phe Glu His Val Ile Tyr Gln Val Gl - #u Pro Glu Lys Gly Gly     140                 1 - #45                 1 - #50                 1 -     #55     - GCA TTA CTC TAC GCC GAG AAG GAT ATC GAT TT - #A AGA GAC TCG CAG TAT      529     Ala Leu Leu Tyr Ala Glu Lys Asp Ile Asp Le - #u Arg Asp Ser Gln Tyr     #               170     - AAG ATA CGA AGT ATC AAG CCA CAG CGG ATC GT - #C TCT CAC TAT TTG GAA      577     Lys Ile Arg Ser Ile Lys Pro Gln Arg Ile Va - #l Ser His Tyr Leu Glu     #           185     - ATA CAT ATT GTC GTT GAA AAG CAA ATG TTT GA - #G CAT ATC GGG GCT GAT      625     Ile His Ile Val Val Glu Lys Gln Met Phe Gl - #u His Ile Gly Ala Asp     #       200     - ACA GCC ATT GTC ACT CAA AAG ATT TTC CAG TT - #G ATT GGA CTG GCA AAT      673     Thr Ala Ile Val Thr Gln Lys Ile Phe Gln Le - #u Ile Gly Leu Ala Asn     #   215     - GCT ATC TTT GCC CCC TTT AAT CTT ACA GTA AT - #T CTG TCT TCC CTG GAA      721     Ala Ile Phe Ala Pro Phe Asn Leu Thr Val Il - #e Leu Ser Ser Leu Glu     220                 2 - #25                 2 - #30                 2 -     #35     - TTT TGG ATG GAT GAA AAC AAA ATC TTG ACC AC - #A GGC GAT GCT AAC AAG      769     Phe Trp Met Asp Glu Asn Lys Ile Leu Thr Th - #r Gly Asp Ala Asn Lys     #               250     - TTG CTC TAC AGG TTC CTG AAG TGG AAA CAG TC - #G TAC CTT GTT CTG CGA      817     Leu Leu Tyr Arg Phe Leu Lys Trp Lys Gln Se - #r Tyr Leu Val Leu Arg     #           265     - CCA CAT GAT ATG GCG TTT TTA CTC GTC TAC AG - #G AAC ACT ACC GAT TAT      865     Pro His Asp Met Ala Phe Leu Leu Val Tyr Ar - #g Asn Thr Thr Asp Tyr     #       280     - GTT GGC GCT ACC TAT CAA GGG AAG ATG TGT GA - #C AAG AAC TAT GCA GGA      913     Val Gly Ala Thr Tyr Gln Gly Lys Met Cys As - #p Lys Asn Tyr Ala Gly     #   295     - GGA GTT GCT TTG CAC CCC AAA GCC GTA ACT CT - #G GAA TCA CTT GCA ATT      961     Gly Val Ala Leu His Pro Lys Ala Val Thr Le - #u Glu Ser Leu Ala Ile     300                 3 - #05                 3 - #10                 3 -     #15     - ATT TTA GTT CAG CTG CTG AGC CTC AGC ATG GG - #G CTA GCG TAT GAC GAC     1009     Ile Leu Val Gln Leu Leu Ser Leu Ser Met Gl - #y Leu Ala Tyr Asp Asp     #               330     - GTG AAC AAG TGC CAG TGT GGC GTA CCT GTC TG - #C GTG ATG AAC CCG GAA     1057     Val Asn Lys Cys Gln Cys Gly Val Pro Val Cy - #s Val Met Asn Pro Glu     #           345     - GCG CCT CAC TCC AGC GGT GTC CGG GCC TTC AG - #T AAC TGC AGC ATG GAG     1105     Ala Pro His Ser Ser Gly Val Arg Ala Phe Se - #r Asn Cys Ser Met Glu     #       360     - GAC TTT TCC AAG TTT ATC ACA AGT CAA AGC TC - #C CAC TGT CTG CAG AAC     1153     Asp Phe Ser Lys Phe Ile Thr Ser Gln Ser Se - #r His Cys Leu Gln Asn     #   375     - CAG CCA ACG CTA CAG CCA TCT TAC AAG ATG GC - #G GTC TGT GGG AAT GGA     1201     Gln Pro Thr Leu Gln Pro Ser Tyr Lys Met Al - #a Val Cys Gly Asn Gly     380                 3 - #85                 3 - #90                 3 -     #95     - GAG GTG GAA GAA GAT GAA ATT TGC GAC TGT GG - #A AAG AAG GGC TGT GCA     1249     Glu Val Glu Glu Asp Glu Ile Cys Asp Cys Gl - #y Lys Lys Gly Cys Ala     #               410     - GAA ATG CCC CCG CCA TGC TGT AAC CCC GAC AC - #C TGT AAG CTG TCA GAT     1297     Glu Met Pro Pro Pro Cys Cys Asn Pro Asp Th - #r Cys Lys Leu Ser Asp     #           425     - GGC TCC GAG TGC TCC AGC GGG ATA TGC TGC AA - #C TCG TGC AAG CTG AAG     1345     Gly Ser Glu Cys Ser Ser Gly Ile Cys Cys As - #n Ser Cys Lys Leu Lys     #       440     - CGG AAA GGG GAG GTT TGC AGG CTT GCC CAA GA - #T GAG TGT GAT GTC ACA     1393     Arg Lys Gly Glu Val Cys Arg Leu Ala Gln As - #p Glu Cys Asp Val Thr     #   455     - GAG TAC TGC AAC GGC ACA TCC GAA GTG TGT GA - #A GAC TTC TTT GTT CAA     1441     Glu Tyr Cys Asn Gly Thr Ser Glu Val Cys Gl - #u Asp Phe Phe Val Gln     460                 4 - #65                 4 - #70                 4 -     #75     - AAC GGT CAC CCA TGT GAC AAT CGC AAG TGG AT - #C TGT ATT AAC GGC ACC     1489     Asn Gly His Pro Cys Asp Asn Arg Lys Trp Il - #e Cys Ile Asn Gly Thr     #               490     - TGT CAG AGT GGA GAA CAG CAG TGC CAG GAT CT - #A TTT GGC ATC GAT GCA     1537     Cys Gln Ser Gly Glu Gln Gln Cys Gln Asp Le - #u Phe Gly Ile Asp Ala     #           505     - GGC TTT GGT TCA AGT GAA TGT TTC TGG GAG CT - #G AAT TCC AAG AGC GAC     1585     Gly Phe Gly Ser Ser Glu Cys Phe Trp Glu Le - #u Asn Ser Lys Ser Asp     #       520     - ATA TCT GGG AGC TGT GGA ATC TCT GCT GGG GG - #A TAC AAG GAA TGC CCA     1633     Ile Ser Gly Ser Cys Gly Ile Ser Ala Gly Gl - #y Tyr Lys Glu Cys Pro     #   535     - CCT AAT GAC CGG ATG TGT GGG AAA ATA ATA TG - #T AAA TAC CAA AGT GAA     1681     Pro Asn Asp Arg Met Cys Gly Lys Ile Ile Cy - #s Lys Tyr Gln Ser Glu     540                 5 - #45                 5 - #50                 5 -     #55     - AAT ATA CTA AAA TTG AGG TCT GCC ACT GTT AT - #T TAT GCC AAT ATA AGC     1729     Asn Ile Leu Lys Leu Arg Ser Ala Thr Val Il - #e Tyr Ala Asn Ile Ser     #               570     - GGG CAT GTC TGC GTT TCC CTG GAA TAT CCC CA - #A GGT CAT AAT GAG AGC     1777     Gly His Val Cys Val Ser Leu Glu Tyr Pro Gl - #n Gly His Asn Glu Ser     #           585     - CAG AAG ATG TGG GTG AGA GAT GGA ACC GTC TG - #C GGG TCA AAT AAG GTT     1825     Gln Lys Met Trp Val Arg Asp Gly Thr Val Cy - #s Gly Ser Asn Lys Val     #       600     - TGC CAG AAT CAA AAA TGT GTA GCA GAC ACT TT - #C TTG GGC TAT GAT TGC     1873     Cys Gln Asn Gln Lys Cys Val Ala Asp Thr Ph - #e Leu Gly Tyr Asp Cys     #   615     - AAC CTG GAA AAA TGC AAC CAC CAT GGT GTA TG - #T AAT AAC AAG AAG AAC     1921     Asn Leu Glu Lys Cys Asn His His Gly Val Cy - #s Asn Asn Lys Lys Asn     620                 6 - #25                 6 - #30                 6 -     #35     - TGC CAC TGT GAC CCC ACA TAC TTA CCT CCA GA - #T TGT AAA AGA ATG AAA     1969     Cys His Cys Asp Pro Thr Tyr Leu Pro Pro As - #p Cys Lys Arg Met Lys     #               650     - GAT TCA TAT CCT GGC GGG AGC ATT GAT AGT GG - #C AAC AAG GAA AGG GCT     2017     Asp Ser Tyr Pro Gly Gly Ser Ile Asp Ser Gl - #y Asn Lys Glu Arg Ala     #           665     - GAA CCC ATC CCT GTA CGG CCC TAC ATT GCA AG - #T CGT TAC CGC TCC AAG     2065     Glu Pro Ile Pro Val Arg Pro Tyr Ile Ala Se - #r Arg Tyr Arg Ser Lys     #       680     - TCT CCA CGG TGG CCA TTT TTC TTG ATC ATC CC - #T TTC TAC GTT GTG ATC     2113     Ser Pro Arg Trp Pro Phe Phe Leu Ile Ile Pr - #o Phe Tyr Val Val Ile     #   695     - CTT GTC CTG ATT GGG ATG CTG GTA AAA GTC TA - #T TCC CAA AGG ATG AAA     2161     Leu Val Leu Ile Gly Met Leu Val Lys Val Ty - #r Ser Gln Arg Met Lys     700                 7 - #05                 7 - #10                 7 -     #15     - TGG AGA ATG GAT GAC TTC TCA AGC GAA GAG CA - #A TTT GAA AGT GAA AGT     2209     Trp Arg Met Asp Asp Phe Ser Ser Glu Glu Gl - #n Phe Glu Ser Glu Ser     #               730     - GAA TCC AAA GAC TAGTCTGGAC AGATTCCACA ATGTCACAAG TA - #ATTCTCTT     2261     Glu Ser Lys Asp                 735     - CAGTGGACAG AAAAAAAAGT GGAAAAGAAA AGCCTATGCA TTATCTTGCC TG - #AAAGTCAA     2321     - GCCTGCATAT CGTGGTCTCC ATCAGGCCAG AAATCATATC TCTCCATTAC AC - #ATGTATGA     2381     - TACATATGTG TGTATATTAT TCCATAAATG ATTTACTTGT AAGAAATGAA TG - #ATTATGAA     2441     - TTTCATATTA TACTTTGATA TTTTACCCTA TTTCTGGTAG TCGGTAGTCA TC - #AATTGTAT     2501     - TTTCTAGTAG GTACATTATA GAAAAGGCTA TAAGAAAATA AATGTGGTAC CA - #     2553     - (2) INFORMATION FOR SEQ ID NO:6:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 735 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     - Met Arg Leu Ile Leu Leu Leu Leu Ser Gly Le - #u Ser Glu Leu Gly Gly     #                 15     - Leu Ser Gln Ser Gln Thr Glu Gly Thr Arg Gl - #u Lys Leu His Val Gln     #             30     - Val Thr Val Pro Glu Lys Ile Arg Ser Val Th - #r Ser Asn Gly Tyr Glu     #         45     - Thr Gln Val Thr Tyr Asn Leu Lys Ile Glu Gl - #y Lys Thr Tyr Thr Leu     #     60     - Asp Leu Met Gln Lys Pro Phe Leu Pro Pro As - #n Phe Arg Val Tyr Ser     # 80     - Tyr Asp Asn Ala Gly Ile Met Arg Ser Leu Gl - #u Gln Lys Phe Gln Asn     #                 95     - Ile Cys Tyr Phe Gln Gly Tyr Ile Glu Gly Ty - #r Pro Asn Ser Met Val     #           110     - Ile Val Ser Thr Cys Thr Gly Leu Arg Gly Ph - #e Leu Gln Phe Gly Asn     #       125     - Val Ser Tyr Gly Ile Glu Pro Leu Glu Ser Se - #r Ser Gly Phe Glu His     #   140     - Val Ile Tyr Gln Val Glu Pro Glu Lys Gly Gl - #y Ala Leu Leu Tyr Ala     145                 1 - #50                 1 - #55                 1 -     #60     - Glu Lys Asp Ile Asp Leu Arg Asp Ser Gln Ty - #r Lys Ile Arg Ser Ile     #               175     - Lys Pro Gln Arg Ile Val Ser His Tyr Leu Gl - #u Ile His Ile Val Val     #           190     - Glu Lys Gln Met Phe Glu His Ile Gly Ala As - #p Thr Ala Ile Val Thr     #       205     - Gln Lys Ile Phe Gln Leu Ile Gly Leu Ala As - #n Ala Ile Phe Ala Pro     #   220     - Phe Asn Leu Thr Val Ile Leu Ser Ser Leu Gl - #u Phe Trp Met Asp Glu     225                 2 - #30                 2 - #35                 2 -     #40     - Asn Lys Ile Leu Thr Thr Gly Asp Ala Asn Ly - #s Leu Leu Tyr Arg Phe     #               255     - Leu Lys Trp Lys Gln Ser Tyr Leu Val Leu Ar - #g Pro His Asp Met Ala     #           270     - Phe Leu Leu Val Tyr Arg Asn Thr Thr Asp Ty - #r Val Gly Ala Thr Tyr     #       285     - Gln Gly Lys Met Cys Asp Lys Asn Tyr Ala Gl - #y Gly Val Ala Leu His     #   300     - Pro Lys Ala Val Thr Leu Glu Ser Leu Ala Il - #e Ile Leu Val Gln Leu     305                 3 - #10                 3 - #15                 3 -     #20     - Leu Ser Leu Ser Met Gly Leu Ala Tyr Asp As - #p Val Asn Lys Cys Gln     #               335     - Cys Gly Val Pro Val Cys Val Met Asn Pro Gl - #u Ala Pro His Ser Ser     #           350     - Gly Val Arg Ala Phe Ser Asn Cys Ser Met Gl - #u Asp Phe Ser Lys Phe     #       365     - Ile Thr Ser Gln Ser Ser His Cys Leu Gln As - #n Gln Pro Thr Leu Gln     #   380     - Pro Ser Tyr Lys Met Ala Val Cys Gly Asn Gl - #y Glu Val Glu Glu Asp     385                 3 - #90                 3 - #95                 4 -     #00     - Glu Ile Cys Asp Cys Gly Lys Lys Gly Cys Al - #a Glu Met Pro Pro Pro     #               415     - Cys Cys Asn Pro Asp Thr Cys Lys Leu Ser As - #p Gly Ser Glu Cys Ser     #           430     - Ser Gly Ile Cys Cys Asn Ser Cys Lys Leu Ly - #s Arg Lys Gly Glu Val     #       445     - Cys Arg Leu Ala Gln Asp Glu Cys Asp Val Th - #r Glu Tyr Cys Asn Gly     #   460     - Thr Ser Glu Val Cys Glu Asp Phe Phe Val Gl - #n Asn Gly His Pro Cys     465                 4 - #70                 4 - #75                 4 -     #80     - Asp Asn Arg Lys Trp Ile Cys Ile Asn Gly Th - #r Cys Gln Ser Gly Glu     #               495     - Gln Gln Cys Gln Asp Leu Phe Gly Ile Asp Al - #a Gly Phe Gly Ser Ser     #           510     - Glu Cys Phe Trp Glu Leu Asn Ser Lys Ser As - #p Ile Ser Gly Ser Cys     #       525     - Gly Ile Ser Ala Gly Gly Tyr Lys Glu Cys Pr - #o Pro Asn Asp Arg Met     #   540     - Cys Gly Lys Ile Ile Cys Lys Tyr Gln Ser Gl - #u Asn Ile Leu Lys Leu     545                 5 - #50                 5 - #55                 5 -     #60     - Arg Ser Ala Thr Val Ile Tyr Ala Asn Ile Se - #r Gly His Val Cys Val     #               575     - Ser Leu Glu Tyr Pro Gln Gly His Asn Glu Se - #r Gln Lys Met Trp Val     #           590     - Arg Asp Gly Thr Val Cys Gly Ser Asn Lys Va - #l Cys Gln Asn Gln Lys     #       605     - Cys Val Ala Asp Thr Phe Leu Gly Tyr Asp Cy - #s Asn Leu Glu Lys Cys     #   620     - Asn His His Gly Val Cys Asn Asn Lys Lys As - #n Cys His Cys Asp Pro     625                 6 - #30                 6 - #35                 6 -     #40     - Thr Tyr Leu Pro Pro Asp Cys Lys Arg Met Ly - #s Asp Ser Tyr Pro Gly     #               655     - Gly Ser Ile Asp Ser Gly Asn Lys Glu Arg Al - #a Glu Pro Ile Pro Val     #           670     - Arg Pro Tyr Ile Ala Ser Arg Tyr Arg Ser Ly - #s Ser Pro Arg Trp Pro     #       685     - Phe Phe Leu Ile Ile Pro Phe Tyr Val Val Il - #e Leu Val Leu Ile Gly     #   700     - Met Leu Val Lys Val Tyr Ser Gln Arg Met Ly - #s Trp Arg Met Asp Asp     705                 7 - #10                 7 - #15                 7 -     #20     - Phe Ser Ser Glu Glu Gln Phe Glu Ser Glu Se - #r Glu Ser Lys Asp     #               735     - (2) INFORMATION FOR SEQ ID NO:7:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 2650 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 72..2273     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     - CATCTCGCAC TTCCAACTGC CCTGTAACCA CCAACTGCCC TTATTCCGGC TG - #GGACCCAG       60     #GGG CTC GGC GGG CTG     110TG TTT CTG CTC AGC     #Leu Phe Leu Leu Ser Gly Leu Gly Gly Leu     #           745     - CGG ATG GAC AGT AAT TTT GAT AGT TTA CCT GT - #G CAA ATT ACA GTT CCG      158     Arg Met Asp Ser Asn Phe Asp Ser Leu Pro Va - #l Gln Ile Thr Val Pro     #   760     - GAG AAA ATA CGG TCA ATA ATA AAG GAA GGA AT - #T GAA TCG CAG GCA TCC      206     Glu Lys Ile Arg Ser Ile Ile Lys Glu Gly Il - #e Glu Ser Gln Ala Ser     765                 7 - #70                 7 - #75                 7 -     #80     - TAC AAA ATT GTA ATT GAA GGG AAA CCA TAT AC - #T GTG AAT TTA ATG CAA      254     Tyr Lys Ile Val Ile Glu Gly Lys Pro Tyr Th - #r Val Asn Leu Met Gln     #               795     - AAA AAC TTT TTA CCC CAT AAT TTT AGA GTT TA - #C AGT TAT AGT GGC ACA      302     Lys Asn Phe Leu Pro His Asn Phe Arg Val Ty - #r Ser Tyr Ser Gly Thr     #           810     - GGA ATT ATG AAA CCA CTT GAC CAA GAT TTT CA - #G AAT TTC TGC CAC TAC      350     Gly Ile Met Lys Pro Leu Asp Gln Asp Phe Gl - #n Asn Phe Cys His Tyr     #       825     - CAA GGG TAT ATT GAA GGT TAT CCA AAA TCT GT - #G GTG ATG GTT AGC ACA      398     Gln Gly Tyr Ile Glu Gly Tyr Pro Lys Ser Va - #l Val Met Val Ser Thr     #   840     - TGT ACT GGA CTC AGG GGC GTA CTA CAG TTT GA - #A AAT GTT AGT TAT GGA      446     Cys Thr Gly Leu Arg Gly Val Leu Gln Phe Gl - #u Asn Val Ser Tyr Gly     845                 8 - #50                 8 - #55                 8 -     #60     - ATA GAA CCC CTG GAG TCT TCA GTT GGC TTT GA - #A CAT GTA ATT TAC CAA      494     Ile Glu Pro Leu Glu Ser Ser Val Gly Phe Gl - #u His Val Ile Tyr Gln     #               875     - GTA AAA CAT AAG AAA GCA GAT GTT TCC TTA TA - #T AAT GAG AAG GAT ATT      542     Val Lys His Lys Lys Ala Asp Val Ser Leu Ty - #r Asn Glu Lys Asp Ile     #           890     - GAA TCA AGA GAT CTG TCC TTT AAA TTA CAA AG - #C GCA GAG CCA CAG CAA      590     Glu Ser Arg Asp Leu Ser Phe Lys Leu Gln Se - #r Ala Glu Pro Gln Gln     #       905     - GAT TTT GCA AAG TAT ATA GAA ATG CAT GTT AT - #A GTT GAA AAA CAA TTG      638     Asp Phe Ala Lys Tyr Ile Glu Met His Val Il - #e Val Glu Lys Gln Leu     #   920     - TAT AAT CAT ATG GGG TCT GAT ACA ACT GTT GT - #C GCT CAA AAA GTT TTC      686     Tyr Asn His Met Gly Ser Asp Thr Thr Val Va - #l Ala Gln Lys Val Phe     925                 9 - #30                 9 - #35                 9 -     #40     - CAG TTG ATT GGA TTG ACG AAT GCT ATT TTT GT - #T TCA TTT AAT ATT ACA      734     Gln Leu Ile Gly Leu Thr Asn Ala Ile Phe Va - #l Ser Phe Asn Ile Thr     #               955     - ATT ATT CTG TCT TCA TTG GAG CTT TGG ATA GA - #T GAA AAT AAA ATT GCA      782     Ile Ile Leu Ser Ser Leu Glu Leu Trp Ile As - #p Glu Asn Lys Ile Ala     #           970     - ACC ACT GGA GAA GCT AAT GAG TTA TTA CAC AC - #A TTT TTA AGA TGG AAA      830     Thr Thr Gly Glu Ala Asn Glu Leu Leu His Th - #r Phe Leu Arg Trp Lys     #       985     - ACA TCT TAT CTT GTT TTA CGT CCT CAT GAT GT - #G GCA TTT TTA CTT GTT      878     Thr Ser Tyr Leu Val Leu Arg Pro His Asp Va - #l Ala Phe Leu Leu Val     #   1000     - TAC AGA GAA AAG TCA AAT TAT GTT GGT GCA AC - #C TTT CAA GGG AAG ATG      926     Tyr Arg Glu Lys Ser Asn Tyr Val Gly Ala Th - #r Phe Gln Gly Lys Met     #               10201010 - #                1015     - TGT GAT GCA AAC TAT GCA GGA GGT GTT GTT CT - #G CAC CCC AGA ACC ATA      974     Cys Asp Ala Asn Tyr Ala Gly Gly Val Val Le - #u His Pro Arg Thr Ile     #              10350     - AGT CTG GAA TCA CTT GCA GTT ATT TTA GCT CA - #A TTA TTG AGC CTT AGT     1022     Ser Leu Glu Ser Leu Ala Val Ile Leu Ala Gl - #n Leu Leu Ser Leu Ser     #          10505     - ATG GGG ATC ACT TAT GAT GAC ATT AAC AAA TG - #C CAG TGC TCA GGA GCT     1070     Met Gly Ile Thr Tyr Asp Asp Ile Asn Lys Cy - #s Gln Cys Ser Gly Ala     #      10650     - GTC TGC ATT ATG AAT CCA GAA GCA ATT CAT TT - #C AGT GGT GTG AAG ATC     1118     Val Cys Ile Met Asn Pro Glu Ala Ile His Ph - #e Ser Gly Val Lys Ile     #  10805     - TTT AGT AAC TGC AGC TTC GAA GAC TTT GCA CA - #T TTT ATT TCA AAG CAG     1166     Phe Ser Asn Cys Ser Phe Glu Asp Phe Ala Hi - #s Phe Ile Ser Lys Gln     #               11001090 - #                1095     - AAG TCC CAG TGT CTT CAC AAT CAG CCT CGC TT - #A GAT CCT TTT TTC AAA     1214     Lys Ser Gln Cys Leu His Asn Gln Pro Arg Le - #u Asp Pro Phe Phe Lys     #              11150     - CAG CAA GCA GTG TGT GGT AAT GCA AAG CTG GA - #A GCA GGA GAG GAG TGT     1262     Gln Gln Ala Val Cys Gly Asn Ala Lys Leu Gl - #u Ala Gly Glu Glu Cys     #          11305     - GAC TGT GGG ACT GAA CAG GAT TGT GCC CTT AT - #T GGA GAA ACA TGC TGT     1310     Asp Cys Gly Thr Glu Gln Asp Cys Ala Leu Il - #e Gly Glu Thr Cys Cys     #      11450     - GAT ATT GCC ACA TGT AGA TTT AAA GCC GGT TC - #A AAC TGT GCT GAA GGA     1358     Asp Ile Ala Thr Cys Arg Phe Lys Ala Gly Se - #r Asn Cys Ala Glu Gly     #  11605     - CCA TGC TGC GAA AAC TGT CTA TTT ATG TCA AA - #A GAA AGA ATG TGT AGG     1406     Pro Cys Cys Glu Asn Cys Leu Phe Met Ser Ly - #s Glu Arg Met Cys Arg     #               11801170 - #                1175     - CCT TCC TTT GAA GAA TGC GAC CTC CCT GAA TA - #T TGC AAT GGA TCA TCT     1454     Pro Ser Phe Glu Glu Cys Asp Leu Pro Glu Ty - #r Cys Asn Gly Ser Ser     #              11950     - GCA TCA TGC CCA GAA AAC CAC TAT GTT CAG AC - #T GGG CAT CCG TGT GGA     1502     Ala Ser Cys Pro Glu Asn His Tyr Val Gln Th - #r Gly His Pro Cys Gly     #          12105     - CTG AAT CAA TGG ATC TGT ATA GAT GGA GTT TG - #T ATG AGT GGG GAT AAA     1550     Leu Asn Gln Trp Ile Cys Ile Asp Gly Val Cy - #s Met Ser Gly Asp Lys     #      12250     - CAA TGT ACA GAC ACA TTT GGC AAA GAA GTA GA - #G TTT GGC CCT TCA GAA     1598     Gln Cys Thr Asp Thr Phe Gly Lys Glu Val Gl - #u Phe Gly Pro Ser Glu     #  12405     - TGT TAT TCT CAC CTT AAT TCA AAG ACT GAT GT - #A TCT GGA AAC TGT GGT     1646     Cys Tyr Ser His Leu Asn Ser Lys Thr Asp Va - #l Ser Gly Asn Cys Gly     #               12601250 - #                1255     - ATA AGT GAT TCA GGA TAC ACA CAG TGT GAA GC - #T GAC AAT CTG CAG TGC     1694     Ile Ser Asp Ser Gly Tyr Thr Gln Cys Glu Al - #a Asp Asn Leu Gln Cys     #              12750     - GGA AAA TTA ATA TGT AAA TAT GTA GGT AAA TT - #T TTA TTA CAA ATT CCA     1742     Gly Lys Leu Ile Cys Lys Tyr Val Gly Lys Ph - #e Leu Leu Gln Ile Pro     #          12905     - AGA GCC ACT ATT ATT TAT GCC AAC ATA AGT GG - #A CAT CTC TGC ATT GCT     1790     Arg Ala Thr Ile Ile Tyr Ala Asn Ile Ser Gl - #y His Leu Cys Ile Ala     #      13050     - GTG GAA TTT GCC AGT GAT CAT GCA GAC AGC CA - #A AAG ATG TGG ATA AAA     1838     Val Glu Phe Ala Ser Asp His Ala Asp Ser Gl - #n Lys Met Trp Ile Lys     #  13205     - GAT GGA ACT TCT TGT GGT TCA AAT AAG GTT TG - #C AGG AAT CAA AGA TGT     1886     Asp Gly Thr Ser Cys Gly Ser Asn Lys Val Cy - #s Arg Asn Gln Arg Cys     #               13401330 - #                1335     - GTG AGT TCT TCA TAC TTG GGT TAT GAT TGT AC - #T ACT GAC AAA TGC AAT     1934     Val Ser Ser Ser Tyr Leu Gly Tyr Asp Cys Th - #r Thr Asp Lys Cys Asn     #              13550     - GAT AGA GGT GTA TGC AAT AAC AAA AAG CAC TG - #T CAC TGT AGT GCT TCA     1982     Asp Arg Gly Val Cys Asn Asn Lys Lys His Cy - #s His Cys Ser Ala Ser     #          13705     - TAT TTA CCT CCA GAT TGC TCA GTT CAA TCA GA - #T CTA TGG CCT GGT GGG     2030     Tyr Leu Pro Pro Asp Cys Ser Val Gln Ser As - #p Leu Trp Pro Gly Gly     #      13850     - AGT ATT GAC AGT GGC AAT TTT CCA CCT GTA GC - #T ATA CCA GCC AGA CTC     2078     Ser Ile Asp Ser Gly Asn Phe Pro Pro Val Al - #a Ile Pro Ala Arg Leu     #  14005     - CCT GAA AGG CGC TAC ATT GAG AAC ATT TAC CA - #T TCC AAA CCA ATG AGA     2126     Pro Glu Arg Arg Tyr Ile Glu Asn Ile Tyr Hi - #s Ser Lys Pro Met Arg     #               14201410 - #                1415     - TGG CCA TTT TTC TTA TTC ATT CCT TTC TTT AT - #T ATT TTC TGT GTA CTG     2174     Trp Pro Phe Phe Leu Phe Ile Pro Phe Phe Il - #e Ile Phe Cys Val Leu     #              14350     - ATT GCT ATA ATG GTG AAA GTT AAT TTC CAA AG - #G AAA AAA TGG AGA ACT     2222     Ile Ala Ile Met Val Lys Val Asn Phe Gln Ar - #g Lys Lys Trp Arg Thr     #          14505     - GAG GAC TAT TCA AGC GAT GAG CAA CCT GAA AG - #T GAG AGT GAA CCT AAA     2270     Glu Asp Tyr Ser Ser Asp Glu Gln Pro Glu Se - #r Glu Ser Glu Pro Lys     #      14650     - GGG TAGTCTGGAC AACAGAGATG CCATGATATC ACTTCTTCTA GAGTAATTA - #T     2323     Gly     - CTGTGATGGA TGGACACAAA AAAATGGAAA GAAAAGAATG TACATTACCT GG - #TTTCCTGG     2383     - GATTCAAACC TGCATATTGT GATTTTAATT TGACCAGAAA ATATGATATA TA - #TGTATAAT     2443     - TTCACAGATA ATTTACTTAT TTAAAAATGC ATGATAATGA GTTTTACATT AC - #AAATTTCT     2503     - GTTTTTTTAA AGTTATCTTA CGCTATTTCT GTTGGTTAGT AGACACTAAT TC - #TGTCAGTA     2563     - GGGGCATGGT ATAAGGAAAT ATCATAATGT AATGAGGTGG TACTATGATT AA - #AAGCCACT     2623     #           2650   AAAA AAAAAAA     - (2) INFORMATION FOR SEQ ID NO:8:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 734 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     - Met Trp Val Leu Phe Leu Leu Ser Gly Leu Gl - #y Gly Leu Arg Met Asp     #                 15     - Ser Asn Phe Asp Ser Leu Pro Val Gln Ile Th - #r Val Pro Glu Lys Ile     #             30     - Arg Ser Ile Ile Lys Glu Gly Ile Glu Ser Gl - #n Ala Ser Tyr Lys Ile     #         45     - Val Ile Glu Gly Lys Pro Tyr Thr Val Asn Le - #u Met Gln Lys Asn Phe     #     60     - Leu Pro His Asn Phe Arg Val Tyr Ser Tyr Se - #r Gly Thr Gly Ile Met     # 80     - Lys Pro Leu Asp Gln Asp Phe Gln Asn Phe Cy - #s His Tyr Gln Gly Tyr     #                 95     - Ile Glu Gly Tyr Pro Lys Ser Val Val Met Va - #l Ser Thr Cys Thr Gly     #           110     - Leu Arg Gly Val Leu Gln Phe Glu Asn Val Se - #r Tyr Gly Ile Glu Pro     #       125     - Leu Glu Ser Ser Val Gly Phe Glu His Val Il - #e Tyr Gln Val Lys His     #   140     - Lys Lys Ala Asp Val Ser Leu Tyr Asn Glu Ly - #s Asp Ile Glu Ser Arg     145                 1 - #50                 1 - #55                 1 -     #60     - Asp Leu Ser Phe Lys Leu Gln Ser Ala Glu Pr - #o Gln Gln Asp Phe Ala     #               175     - Lys Tyr Ile Glu Met His Val Ile Val Glu Ly - #s Gln Leu Tyr Asn His     #           190     - Met Gly Ser Asp Thr Thr Val Val Ala Gln Ly - #s Val Phe Gln Leu Ile     #       205     - Gly Leu Thr Asn Ala Ile Phe Val Ser Phe As - #n Ile Thr Ile Ile Leu     #   220     - Ser Ser Leu Glu Leu Trp Ile Asp Glu Asn Ly - #s Ile Ala Thr Thr Gly     225                 2 - #30                 2 - #35                 2 -     #40     - Glu Ala Asn Glu Leu Leu His Thr Phe Leu Ar - #g Trp Lys Thr Ser Tyr     #               255     - Leu Val Leu Arg Pro His Asp Val Ala Phe Le - #u Leu Val Tyr Arg Glu     #           270     - Lys Ser Asn Tyr Val Gly Ala Thr Phe Gln Gl - #y Lys Met Cys Asp Ala     #       285     - Asn Tyr Ala Gly Gly Val Val Leu His Pro Ar - #g Thr Ile Ser Leu Glu     #   300     - Ser Leu Ala Val Ile Leu Ala Gln Leu Leu Se - #r Leu Ser Met Gly Ile     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Tyr Asp Asp Ile Asn Lys Cys Gln Cys Se - #r Gly Ala Val Cys Ile     #               335     - Met Asn Pro Glu Ala Ile His Phe Ser Gly Va - #l Lys Ile Phe Ser Asn     #           350     - Cys Ser Phe Glu Asp Phe Ala His Phe Ile Se - #r Lys Gln Lys Ser Gln     #       365     - Cys Leu His Asn Gln Pro Arg Leu Asp Pro Ph - #e Phe Lys Gln Gln Ala     #   380     - Val Cys Gly Asn Ala Lys Leu Glu Ala Gly Gl - #u Glu Cys Asp Cys Gly     385                 3 - #90                 3 - #95                 4 -     #00     - Thr Glu Gln Asp Cys Ala Leu Ile Gly Glu Th - #r Cys Cys Asp Ile Ala     #               415     - Thr Cys Arg Phe Lys Ala Gly Ser Asn Cys Al - #a Glu Gly Pro Cys Cys     #           430     - Glu Asn Cys Leu Phe Met Ser Lys Glu Arg Me - #t Cys Arg Pro Ser Phe     #       445     - Glu Glu Cys Asp Leu Pro Glu Tyr Cys Asn Gl - #y Ser Ser Ala Ser Cys     #   460     - Pro Glu Asn His Tyr Val Gln Thr Gly His Pr - #o Cys Gly Leu Asn Gln     465                 4 - #70                 4 - #75                 4 -     #80     - Trp Ile Cys Ile Asp Gly Val Cys Met Ser Gl - #y Asp Lys Gln Cys Thr     #               495     - Asp Thr Phe Gly Lys Glu Val Glu Phe Gly Pr - #o Ser Glu Cys Tyr Ser     #           510     - His Leu Asn Ser Lys Thr Asp Val Ser Gly As - #n Cys Gly Ile Ser Asp     #       525     - Ser Gly Tyr Thr Gln Cys Glu Ala Asp Asn Le - #u Gln Cys Gly Lys Leu     #   540     - Ile Cys Lys Tyr Val Gly Lys Phe Leu Leu Gl - #n Ile Pro Arg Ala Thr     545                 5 - #50                 5 - #55                 5 -     #60     - Ile Ile Tyr Ala Asn Ile Ser Gly His Leu Cy - #s Ile Ala Val Glu Phe     #               575     - Ala Ser Asp His Ala Asp Ser Gln Lys Met Tr - #p Ile Lys Asp Gly Thr     #           590     - Ser Cys Gly Ser Asn Lys Val Cys Arg Asn Gl - #n Arg Cys Val Ser Ser     #       605     - Ser Tyr Leu Gly Tyr Asp Cys Thr Thr Asp Ly - #s Cys Asn Asp Arg Gly     #   620     - Val Cys Asn Asn Lys Lys His Cys His Cys Se - #r Ala Ser Tyr Leu Pro     625                 6 - #30                 6 - #35                 6 -     #40     - Pro Asp Cys Ser Val Gln Ser Asp Leu Trp Pr - #o Gly Gly Ser Ile Asp     #               655     - Ser Gly Asn Phe Pro Pro Val Ala Ile Pro Al - #a Arg Leu Pro Glu Arg     #           670     - Arg Tyr Ile Glu Asn Ile Tyr His Ser Lys Pr - #o Met Arg Trp Pro Phe     #       685     - Phe Leu Phe Ile Pro Phe Phe Ile Ile Phe Cy - #s Val Leu Ile Ala Ile     #   700     - Met Val Lys Val Asn Phe Gln Arg Lys Lys Tr - #p Arg Thr Glu Asp Tyr     705                 7 - #10                 7 - #15                 7 -     #20     - Ser Ser Asp Glu Gln Pro Glu Ser Glu Ser Gl - #u Pro Lys Gly     #               730     __________________________________________________________________________ 

What is claimed is:
 1. A sperm protein in substantially pure form selected from a human PH30 beta chain protein, a mouse PH30 beta chain protein or an amino acid sequence comprising a disintegrin domain of either the human or mouse PH30 beta chain protein;and wherein the sperm protein has an integrin binding sequence selected from FEE or QDE.
 2. The sperm protein of claim 1 which is the human PH30 beta chain protein.
 3. The sperm protein of claim 2, having an integrin binding sequence which is FEE.
 4. A contraceptive composition comprising a therapeutically effective amount of the protein of claim 1 and a pharmaceutically acceptable carrier.
 5. The contraceptive composition of claim 4, wherein the protein is the human PH30 beta chain protein.
 6. The composition of claim 4, wherein the protein is produced by expressing a gene encoding the disintegrin domain of the sperm protein in a recombinant DNA expression vector.
 7. A method of contraception in a human or mouse subject in need thereof, comprising administering to the subject an amount of the sperm protein of claim 1 which is effective for the stimulation of antibodies which bind to the sperm protein in vivo.
 8. An isolated sperm protein made by culturing a transformed host cell which comprises a vector comprising a DNA sequence which encodes a sperm protein selected from a human PH30 beta chain protein as shown in SEQ. I.D. NO. 2 or a mouse PH30 beta chain protein as shown in SEQ. I.D. NO. 4, and isolating the sperm protein.
 9. A sperm protein in substantially pure form selected from a human PH30 beta chain protein comprising SEQ. I.D. NO. 2, a segment of the human PH30 beta chain protein comprising amino acid residues 299-392 of SEQ. I.D. NO. 2, a mouse PH30 beta chain protein comprising SEQ. I.D. NO. 4, or a segment of the mouse PH30 beta chain protein comprising amino acid residues 111-202 of SEQ. I.D. NO.
 4. 10. The sperm protein of claim 9, selected from the human PH30 beta chain protein comprising SEQ. I.D. NO. 2 or the segment of the human PH30 beta chain protein comprising amino acid residues 299-392 of SEQ. I.D. NO.
 2. 11. A contraceptive composition comprising a therapeutically effective amount of the protein of claim 9 and a pharmaceutically acceptable carrier.
 12. A method of contraception in a human subject comprising administering to the subject a therapeutically effective amount of the sperm protein of claim
 10. 